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CRYSTAL STRUCTURE OF AN ANGIOTENSIN-CONVERTING ENZYME (ACE) 
AND USES THEREOF 

FIELD OF INVENTION 

5 The present invention relates to a crystal. In particular, the present invention relates 
to a crystal of ACE protein. 

The present invention further relates to methods, processes, ACE modulators, 
pharmaceutical compositions and uses of the ACE crystal and the structure co- 
1 0 ordinates thereof. 

BACKGROUND TO THE INVENTION 

Angiotensin converting enzyme (peptidyl dipeptidase A, EC 3.4.15.1, ACE) is a 
15 membrane-anchored dipeptidyl carboxypeptidase that is essential for blood pressure 
regulation and electrolyte homeostasis via the renin-angiotensin-aldosterone system. 
The enzyme is a zinc metalloprotease that converts the inactive peptide angiotensin I 
to angiotensin II, a potent vasoconstrictor. ACE, like many diverse membrane-bound 
ectoproteins, is released from the membrane by a membrane protease or secretase (1). 
20 An understanding of this cleavage-secretion mechanism is important for the 
development of therapeutic strategies to address the different pathologies caused by 
defects in the function of the secretase. Substrate determinants that specify cleavage 
by secretases remain incompletely characterised, but may include the physico- 
chemical properties of the juxtamembrane ("stalk") sequence or unidentified 
25 recognition motifs of the stalk or the extracellular domain. Cleavage of ACE occurs 
in the stalk sequence and the solubilizing protease is constrained topologically, in 
terms of the number of residues between the cleavage site and the proximal 
extracellular domain of ACE. However, the ACE secretase appears to be remarkably 
versatile in terms of its substrate specificity (2,3). 

30 

The active sites of ACE and carboxypeptidase A, a prototypic zinc metalloprotease, 
are understood to be very similar and this similarity is exploited in the design of the 
first generation of ACE inhibitors. The clinical success of these inhibitors - such as 
captopril and enalapril - in the treatment of hypertension and congestive heart failure 
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is well established. However, the side effects such as persistent cough which effects 
up to 20 % of patients and angioedema which is less common, together with 
limitations such as their contraindication in patients with impaired renal function and 
decreased efficacy in patients with low-renin hypertension, underscore the need for 
5 more specific and selective inhibitors. 

There are two isoforms of ACE that are transcribed from the same gene in a tissue 
specific manner. In somatic tissues it exists as a glycoprotein composed of a single 
large polypeptide chain of 1277 amino acids whereas the germinal form is synthesised 

10 as a lower molecular mass isozyme and is thought to play a role in sperm maturation 
and the binding of sperm to the oviduct epithelium. The somatic form consists of two 
domains (N- and C-domain), each containing an active site with a conserved HEXXH 
zinc-binding motif and a glutamate some 24 residues downstream which forms the 
third zinc ligand (Williams et al., 1994). The two domains differ in their substrate 

15 specificities; inhibitor and chloride activation profiles; and physiological functions. 
There are two N-domain-specific substrates: the hemoregulatory peptide iV-acetyl- 
seryl-aspartyl-lysyl-proline (AcSDKP) which controls hematopoietic stem cell 
differentiation and proliferation; and the bradykinin potentiating peptide angiotensin- 
(1 -7). On the other hand, both active sites catalyse the hydrolysis of angiotensin I and 

20 the vasodilator bradykinin with similar efficiency. However, inhibition of the N- 
domain with a phosphinic peptide RXP407 has no effect on blood pressure regulation 
(Junot et al., 2001) and, furthermore, expression of the N-domain only, in transgenic 
mice produced a phenotype similar to the ACE knockout mice (Esther et al., 1997). 
Thus, the C-domain appears to be necessary and sufficient for controlling blood 

25 pressure and cardiovascular function. Testis ACE (tACE) is identical to the C- 
terminal half of somatic ACE, except for a unique 36-residue sequence constituting its 
amino terminus, thus this isoform is selected for initial efforts to obtain a three- 
dimensional structure. 

30 The cDNA sequence of human testicular ACE has been described (Ehlers et al. (1989) 
Proa Nat. Acad Sci 86: 7741-7745) and the predicted protein consists of a 732-residue 
preprotein including a 31-residue signal peptide. The mature polypeptide has a 
molecular weight of 80,073 (unglycosylated form). 
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Despite the pivotal role of ACE, there have been no reports disclosing that suitable 
crystals have been or could be obtained for this enzyme and so the X-ray 
crystallographic analysis of such proteins has been impossible. 

SUMMARY OF THE INVENTION 

The present invention is based upon the seminal finding of the first three-dimensional 
structure of the ACE protein. 

Peptidases, for example, thermolysin and carboxypeptidase A that have been used in 
comparative molecular field analysis and 3D quantitative structure-activity 
relationship studies of ACE (Waller & Marshall, 1993) show no homology with the 
structure of ACE described herein. Surprisingly, ACE shows significant structural 
homology with that of neurolysin, a member of the M3 family of thimet 
oligopeptidases. The two proteins do not share any amino acid sequence identity 
(close to random score), yet when the two structures are optimally superimposed 
using DALI server (Holm & Sander, 1999), there is noticeable match with a root 
mean square (r.m.s.) deviation of 4.0 A for 143 C a atoms. Accordingly, the structure 
presented herein may be used for the development of novel, highly selective ACE 
modulators with the potential for greater efficacy, fewer side effects and treatment of 
new indications. 

Thus, in a first aspect, the present invention relates to a crystal of ACE protein. 
Preferably, the ACE protein is underglycosylated. 

Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein is deglycosylated at amino acids 337 and 586 or 
amino acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 

Preferably, the crystal comprises atoms arranged in a spatial relationship represented 
by at least a portion of the structure co-ordinates of Table A or Table B. 
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Preferably, the crystal belongs to the space group P2j2i2i. 

Preferably, the crystal has the unit cell dimensions: a= 56.47 A, b=84.90 A and 
c=133.99 A. 

5 

Preferably, the crystal is a crystal of human ACE protein. 

Preferably, the crystal further comprises an entity bound to the ACE protein or a 
portion thereof. More preferably, the entity is bound to the ACE protein or a portion 
10 thereof by contacting one or more residues of the ACE protein selected from: His384, 
Ala385, Lys542, Tyr551, Tyr554, Glu415 and His544. More preferably, the entity 
modulates the activity of ACE. More preferably, the entity is an inhibitor of ACE - 
such as lisinopril or a derivative thereof. 

15 In a second aspect, the present invention relates to a method of preparing a crystal of 
ACE protein comprising the steps of: (a) culturing host cells comprising an 
underglycosylated ACE protein; (b) purifying the underglycosylated ACE protein; 
and (c) crystallising the underglycosylated ACE protein. 

20 Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein is deglycosylated at amino acids 337 or amino 
acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 

25 Preferably, the ACE protein is crystallised using about 10 mM HEPES and about 
0.1% PMSF with an equal volume of a reservoir solution containing about 15 % PEG 
4000, about 50 mM CH 3 COONa.3H 2 0 pH 4.7 and about 10 fiM ZnS0 4 .7H 2 0. 

Preferably, the crystal that is prepared has a structure defined by at least a portion of 
30 the structure co-ordinates of Table A or Table B. 

Preferably, the crystal belongs to the space group P2j2i2i. 
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Preferably, the crystal has the unit cell dimensions: a= 56.47 A, b=84.90 A and 
c=133.99A. 

Preferably, the ACE protein is human ACE protein. 

5 

Preferably, the crystal further comprises an entity bound to the ACE protein. More 
preferably, the entity modulates the activity of ACE. More preferably, the entity is an 
inhibitor of ACE - such as lisinopril or a derivative thereof. More preferably, the 
crystal that is prepared has a structure defined by at least a portion of the structure co- 
10 ordinates of Table B. 

In a third aspect, the present invention relates to a method of screening for a 
modulator of ACE wherein the method comprises the use of a crystal according to the 
present invention. Preferably, the method comprises the steps of: (a) providing at 

15 least a portion of the structure co-ordinates of Table A or Table B; (b) employing at 
least a portion of the structure co-ordinates of Table A or Table B to design or select 
or synthesise a putative modulator of ACE; (c) contacting the putative modulator of 
ACE with ACE or a mutant, variant, homologue, derivative or fragment thereof in the 
presence of a substrate; and (d) screening the putative modulator of ACE in an assay 

20 for the potential to modulate ACE. 

Preferably, at least a portion of the structure co-ordinates of Table A or Table B and/or 
the putative modulator of ACE and/or the substrate are provided on a machine-readable 
data storage medium comprising a data storage material encoded with machine 
25 readable data. 

Preferably, the putative ACE modulator is from a library of compounds. 

Preferably, the putative ACE modulator is selected from a database. 

30 

Preferably, the putative ACE modulator is designed de novo. 



Preferably, the putative ACE modulator is designed from a known ACE modulator. 
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Preferably, the design or selection of the putative ACE modulator is performed in 
conjunction with computer modelling. 

Preferably, the putative ACE modulator is useful in the prevention and/or treatment of an 
5 ACE related disorder. More preferably, the ACE related disorder is hypertension. 

In a fourth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more modulators of ACE; and (c) preparing a quantity of those one or 
1 0 more ACE modulators. 

In a fifth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more ACE modulators; and (c) preparing a pharmaceutical 
15 composition comprising those one or more identified ACE modulators. 

In a sixth aspect, the present invention relates to a process comprising the steps of: (a) 
performing the method according to the third aspect of the present invention; (b) 
identifying one or more ACE modulators; (c) modifying those one or more ACE 
20 modulators; and (d) optionally preparing a pharmaceutical composition comprising 
those one or more ACE modulators. 

In a seventh aspect, the present invention relates to a method of obtaining structural 
information about a molecule or a molecular complex of unknown structure by using at 

25 least a portion of the structure co-ordinates of ACE, comprising the steps of: (a) 
generating X-ray diffraction data from a crystallised molecule or molecular complex; (b) 
applying at least a portion of the structure co-ordinates of ACE to said X-ray diffraction 
pattern to generate a three dimensional electron density map of at least a portion of the 
molecule or molecular complex; and (c) using all or a portion of the structure co- 

30 ordinates of ACE to generate homology models of ACE. 
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In an eighth aspect, the present invention relates to an ACE modulator identified by 
the method according to the third aspect of the present invention. Preferably, the ACE 
modulator inhibits ACE. 

5 In a ninth aspect, the present invention relates to a pharmaceutical composition 
comprising an ACE modulator according to the seventh aspect of the present 
invention and a pharmaceutically acceptable carrier, diluent, excipient or adjuvant or 
any combination thereof. 

10 In a tenth aspect, the present invention relates to a method of preventing and/or 
treating an an ACE related disorder comprising administering a modulator of ACE 
according to the seventh aspect of the present invention and/or a pharmaceutical 
according to the eighth aspect of the present invention, wherein said modulator of 
ACE or said pharmaceutical is capable of causing a beneficial preventative and/or 

15 therapeutic effect 

In an eleventh aspect, the present invention relates to a computer for producing a 
three-dimensional representation of ACE wherein said computer comprises: (a) a 
computer-readable data storage medium comprising a data storage material encoded 

20 with computer-readable data, wherein said data comprises the structure co-ordinates 
of ACE; (b) a working memory for storing instructions for processing said computer- 
readable data; (c) a central-processing unit coupled to said working memory and to 
said computer-readable data storage medium for processing said computer-machine 
readable data into said three-dimensional representation; and (d) a display coupled to 

25 said central-processing unit for displaying said three-dimensional representation. 

In a twelfth aspect, the present invention relates to a machine-readable data storage 
medium comprising a data storage material encoded with machine-readable data, 
wherein the data is defined by at least a portion of the structure co-ordinates of ACE 
30 in Table A or Table B. 
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In a thirteenth aspect, the present invention relates to the use of an ACE crystal in the 
preparation of a medicament to prevent and/or treat an ACE related disorder. 
Preferably, the ACE related disorder is hypertension. 

5 In a fourteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to screen for modulators of ACE. 

In a fifteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to solve the structure of the crystalline 
10 form of any other protein with significant amino acid sequence homology to any 
functional domain of ACE. 

In a sixteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B in molecular design techniques to design, 
1 5 select and synthesise modulators of ACE. 

In a seventeenth aspect, the present invention relates to the use of at least a portion of 
the structure co-ordinates of Table A or Table B in the development of compounds 
that can isomerise to reaction intermediates in the chemical reaction of a substrate or 
20 other compound that binds to ACE. 

In an eighteenth aspect, the present invention relates to the use of at least a portion of 
the structure co-ordinates of Table A or Table B to screen small molecule databases 
for chemical entities or compounds that modulate ACE. 

25 

In a nineteenth aspect, the present invention relates to the use of at least a portion of the 
structure co-ordinates of Table A or Table B to solve the structure of the crystalline 
form of any other protein with significant amino acid sequence homology to any 
functional domain of ACE. Preferably, the structure of the crystalline form of any 
30 other protein with significant amino acid sequence homology to any functional 
domain of ACE is solved using molecular replacement. 

In a twentieth aspect, the present invention relates to the expression vectors pLEN- 
tACEA36g(l, 2, 3, 4) and pLEN- tACEA36g(l,3). 
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DETAILED DESCRIPTION OF THE INVENTION 
ACE PROTEIN 

5 

ACE (EC 3.4.15.1) is a peptidyldipeptide hydrolase. It catalyzes the hydrolysis of the 
penultimate peptide bond at the C-tenninal end of a variety of acylated tripeptides and 
larger polypeptides having an unblocked alpha-carboxy group having a free C- 
terminus. The reactivity of the enzyme varies markedly depending on the substrate. 

10 

As used herein, the term "ACE protein" includes all vertebrate and mammalian forms 
and is intended to cover mutants, variants, homologues, derivatives and fragments 
thereof. Preferably, the mutants, variants, homologues, derivatives and fragments 
thereof have the activity of the naturally occurring ACE. 

15 

There are two isoforms of ACE that are transcribed from the same gene in a tissue 
specific manner. In somatic tissues, it exists as a glycoprotein composed of a single 
large polypeptide chain of 1277 amino acids whereas the germinal form is synthesised 
as a lower molecular mass isozyme and is thought to play a role in sperm maturation 

20 and the binding of sperm to the oviduct epithelium. The somatic form consists of two 
domains (N- and C-domain), each containing an active site with a conserved HEXXH 
zinc-binding motif and a glutamate some 24 residues downstream which forms the 
third zinc ligand (Williams et al., 1994). The two domains differ in their substrate 
specificities; inhibitor and chloride activation profiles; and physiological functions. 

25 There are two N-domain-specific substrates: the hemoregulatory peptide iV-acetyl- 
seryl-aspartyl-lysyl-proline (AcSDKP) which controls hematopoietic stem cell 
differentiation and proliferation; and the bradykinin potentiating peptide angiotensin- 
(1-7). On the other hand, both active sites catalyse the hydrolysis of angiotensin I and 
the vasodilator bradykinin with similar efficiency. However, inhibition of the N- 

30 domain with a phosphinic peptide RXP407 had no effect on blood pressure regulation 
(Junot et al., 2001) and, furthermore, expression of the N-domain only, in transgenic 
mice produced a phenotype similar to the ACE knockout mice (Esther et al., 1997). 
Thus, the C-domain appears to be necessary and sufficient for controlling blood 
pressure and cardiovascular function. Testis ACE (tACE) is identical to the C- 
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terminal half of somatic ACE, except for a unique 36-residue sequence constituting its 
amino terminus. 



Further background teachings on ACE have been presented by Victor A. McKusick et 
ah at http://www3.ncbi.r^ The following 

information concerning ACE has been extracted from that source. 



Angiotensin I-converting enzyme or kininase II, is a dipeptidyl carboxypeptidase that plays an 
important role in blood pressure regulation and electrolyte balance by hydrolyzing angiotensin 
I into angiotensin II, a potent vasopressor, and aldosterone-stimulating peptide. The enzyme is 
also able to inactivate bradykinin, a potent vasodilator. 

Ehlers et al. (1989) Proc. Nat Acad Sci. 86: 7741-7745 determined the cDNA sequence for 
human testicular ACE. The predicted protein is identical, from residue 37 to its C terminus, to 
the second half or C-terminal domain of the endothelial ACE sequence. The inferred protein 
sequence consists of a 732-residue preprotein including a 31-residue signal peptide. Hie 
mature polypeptide has a molecular weight of 80,073. 

Howard et al. (1990) Mol Cell Biol 10: 4294-4302, found that the testis-specific form of 
ACE has its own promoter within intron 12, is encoded by the 3-prime region of the gene, and 
is found only in postmeiotic spermatogenic cells and sperm. 

Although angiotensin-converting enzyme has been studied primarily in the context of its role 
in blood pressure regulation, this widely distributed enzyme has many other physiologic 
functions. The ACE gene encodes 2 isozymes. The somatic ACE isozyme is expressed in 
many tissues, including vascular endothelial cells, renal epithelial cells, and testicular Leydig 
cells, whereas the testicular or germinal ACE isozyme is expressed only in sperm Ramaraj et 
al., (1998) J. Clin. Invest 102: 371-378. 

Brown et al. (1996) Clin. Pharmacol Therapeutics 60: 8-13 found an association between the 
use of certain ACE inhibitors (lisinopril or enalapril vs captopril) and emergent angioedema in 
the African- American population of Tennessee. The adjusted relative risk of angioedema was 
4.5 (95% CI, 2.9-6.8) in blacks compared to whites. The African-American patients were 
more severely affected: 7 of the 8 patients admitted to the intensive care unit were black, as 
were all patients who required intubation. African-American users of ACE inhibitors tended to 
be younger and female when compared to their white counterparts. The rate of angioedema 
was highest within the first 30 days of use (5.79 per 1000 patient-years) compared to long- 
term use (1 .04 per 1000 patient-years). 

Large-scale trials of therapy for heart feilure showed improvements in outcome with ACE 
inhibitors and beta-blockers. These results led to the recommendation that all patients who 
have heart failure accompanied by a low ejection fraction and who can tolerate ACE inhibitors 
and beta-blockers should be treated with both agents. Exner et al. (2001) New Eng. J. Med 
344: 1351-1357 focused on the fact that black patients with heart failure have a poorer 
prognosis than white patients and performed a study comparing racial groups. They found that 
whereas therapy with enalapril is associated with significant reduction in the risk of 
hospitalization for heart failure among white patients with left ventricular function, it had no 
such effect in similar black patients. The explanation for the lesser response to the ACE 
inhibitor in black patients was not clear. 

Mattei et al. (1989) Cytogenet Cell Genet 51: 1041 assigned the ACE gene to 17q23 by in 
situ hybridization. Using a DNA marker at the growth hormone gene locus, which they 
characterized as 'extremely polymorphic* and which showed no recombination with ACE, 
Jeunemaitre et al. (1992) Nature Genet 1: 72-75, mapped ACE to 17q22-q24, consistent with 
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the in situ hybridization mapping to 17q23. A demonstration of linkage between the ACE 
locus and elevated blood pressure in a rat model of hypertension pointed to ACE as a 
candidate gene in human hypertension. In studies of hypertensive families, they found no 
evidence to support linkage between the ACE locus and the disease, however. 

Krege et al. (1995) Nature 375: 146-148 investigated the role of the ACE gene in blood 
pressure control and reproduction using mice generated to cany an insertional mutation that 
was designed to inactivate both forms of Ace. All homozygous female mutants were found to 
be fertile, but the fertility of homozygous male mutants was greatly reduced. Heterozygous 
males but not females had blood pressures that were 15 to 20 mm Hg less than normal, 
although both male and female heterozygotes had reduced serum Ace activity. 

Although significant ACE activity is found in plasma, the majority of the enzyme is bound to 
tissue such as vascular endothelium. Esther et al. (1997) . J. Clin. Invest 99: 2375-2385, used 

15 targeted homologous recombination to create mice expressing a form of ACE that lacks the C- 

terminal half of the molecule. This modified ACE protein was catalytically active but entirely 
secreted from cells. Mice that expressed only this modified ACE had significant plasma ACE 
activity but no tissue-bound enzyme. These animals had low blood pressure, renal vascular 
thickening, and a urine-concentrating defect. The phenotype was very similar to that of 

20 completely ACE-deficient mice previously reported, except that the renal pathology was less 

severe. These studies strongly supported the concept that the tissue-bound ACE is essential for 
the control of blood pressure and the structure and function of the kidney. 

ACE gene knockout mice lack both isozymes and exhibit low blood pressure, kidney 
25 dysfunctions, and male infertility. Ramaraj et al. (1998) J. Clin. Invest 102: 371-378, reported 

the use of a sperm-specific promoter and interbreeding of transgenic and gene knockout mice 
for generating a mouse strain that expressed ACE only in sperm. The experimental mice 
maintained the kidney defects of ACE -/- mice, but unlike the knockout strain, the males were 
fertile. Thus, Ramaraj et al. (1998) established that the role of ACE in male fertility is 
30 completely dependent on its exclusive expression in sperm. Their study demonstrated how 

transgenic and knockout techniques can be combined for ascribing a specific physiologic 
function to the expression of a multifunctional protein in a given tissue. 

Hagaman et al. (1998) Proc. Nat Acad. Set 95 : 2552-2557 used transgenic mice lacking 
35 somatic and testis ACE to investigate the male fertility defect They demonstrated that mice 

lacking both somatic and testis ACE isozymes have defects in sperm transport within the 
oviducts and in binding to zonae pellucidae. Males generated by gene targeting experiments 
that lack somatic ACE but retain testis ACE are fertile. Both male and female mice lacking 
angiotensinogen have normal fertility. The authors found that males heterozygous for the 
mutation inactivating both ACE enzymes had offspring of wildtype and heterozygous 
genotypes at the same frequency, suggesting that sperm carrying the mutation are not at a 
selective disadvantage. 

Nephropathy of type I diabetes is associated with the D allele of the insertion/deletion (I/D) 
polymorphism in intron 16 of the ACE gene. The D allele determines higher enzyme levels. 
To address causality underlying this association, Huang et al. (2001) Proc. Nat. Acad. Set 98: 
13330-13334 induced diabetes in mice having 1, 2, or 3 copies of the Ace gene, normal blood 
pressure, and an enzyme level range (65-162% of wildtype) comparable to that seen in 
humans. Twelve weeks later, the 3-copy diabetic mice had increased blood pressures and 
50 overt proteinuria. Proteinuria was correlated to plasma ACE level in the 3-copy diabetic mice. 

Thus, a modest genetic increase in ACE levels was sufficient to cause nephropathy in diabetic 



40 



45 



mice. 



55 



CRYSTAL 

In one aspect of the present invention, there is provided a crystal of ACE protein. 
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As used herein, the term "crystal" means a structure (such as a three dimensional (3D) 
solid aggregate) in which the plane faces intersect at definite angles and in which 
there is a regular structure (such as internal structure) of the constituent chemical 
species. Thus, the term "crystal" can include any one of: a solid physical crystal form 
5 such as an experimentally prepared crystal, a 3D model based on the crystal structure, 
a representation thereof - such as a schematic representation thereof, a diagrammatic 
representation thereof, or a data set thereof for a computer. 

The crystals of the present invention may be prepared by purifying ACE protein and 
10 then crystallising the purified protein. The ACE protein may also be prepared by 
expressing a nucleotide sequence encoding the ACE protein in a suitable host cell. 

In a preferred embodiment, the crystals of the present invention are prepared by 
purifying underglycosylated ACE protein and then crystallising the purified 
15 underdeglycosylated protein. The underdeglycosylated ACE protein may also be 
prepared by expressing a nucleotide sequence encoding the underdeglycosylated ACE 
protein in a suitable host cell. 

ACE may be purified using various methods known to a person skilled in the art, for 
20 example, from conditioned media by affinity chromatography on a Sepharose-28- 
lisinopril afSnity resin (Yu et al 1997). The protein may be quantified by amino acid 
analysis and assayed for activity using the substrate hippuryl-L-histidyl-L-leucine, as 
described previously (Ehlers, MRE, Chen, Y -N, Riordan, JF (1991) Proa Natl. Acad 
Sci. USA 88, 1009-1013). 

25 

The purified ACE proteins may be stored at -20 °C in 1 0 mM HEPES and 0. 1% PMSF. 

Concentration may be performed with the aid of a filtration system and the protein 
concentrate may be immediately used for crystallisation purposes. The protein 
30 concentrate may be crystallised using, for example, the vapour diffusion hanging drop 
method at a temperature of from about 1 °C to about 30 °C, preferably from about 4 °C 
to about 20 °C, more preferably at about 16 °C. The crystallisation temperature may be 
dependent on the additives present in the protein solution. 
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Typically, the best crystals for ACE proteins are grown at 16 °C by the vapour 
diffusion hanging drop method by mixing 2 jliI of the protein solution at -11.5 mg/ml 
in 10 mM HEPES and 0.1% PMSF with an equal volume of a reservoir solution 
5 containing 15 % PEG 4000, 50 mM CH 3 COONa.3H 2 0 pH 4.7 and 10 jjM 
ZnS04.7H20. Crystals usually appear within 2 weeks and grow to their maximum 
size after about a month. 

The design of ACE inhibitors has been based upon the assumption that the structure 
10 of ACE is related to that of peptidases - such as thermolysin (MA clan, M2 family) 
and carboxypeptidase A (MC clan, Ml 4 family) as evidenced by comparative 
molecular field analysis and 3D quantitative structure-activity relationship studies of 
ACE (Waller & Marshall, 1993). Marshall & Cramer (1988) Trends Pharamacol 
Set 9, 285-289 have also reported the development of predictive models for inhibitors 
15 of ACE and thermolysin. Surprisingly, the structure of ACE described herein, shows 
no structural homology with thermolysin but shows significant structural homology 
with neurolysis a member of the M3 family of thimet oligopeptidases. Therefore, the 
structure presented herein, may be effectively used for the development of novel, 
highly selective ACE modulators with the potential for greater efficacy, fewer side 
20 effects and treatment of new indications. In addition, the unanticipated similarity with 
neurolysin has shown the structural conservation amongst an merging family of 
peptidases with a common evolutionary origin. 

Without wishing to be bound by theory, it appears that the core structure of the two 
25 proteins is highly similar with different loop structures on the outer surface in the case 
of neurolysin. Indeed like ACE, neurolysin also belongs to the family of 
metallopeptidases bearing the HEXXH active site motif (Rawlings and Barrett, 1995; 
Brown et aL, 2001). The striking similarity also extends to the active site region in 
neurolysin consisting of a deep narrow channel that divides the molecule into two 
30 halves. It has been speculated that using the flexible secondary structure elements in 
the active site cavity, the neuropeptidase can effectively cleave a variety of small 
peptides. Likewise in ACE, the geometry of the active site groove clearly accounts for 
ACE's inability to hydrolyse large, folded substrates. Furthermore, the enzyme's 
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preference for oligopeptide substrates of about thirteen residues or less suggests that 
the substrate does not have the same freedom to extend outside of the channel during 
catalysis. Peptidases - such as thermolysin (MA clan, M4 family) and 
carboxypeptidase A (MC clan, Ml 4 family) - which have been used in comparative 
5 molecular field analysis and 3D quantitative structure-activity relationship studies 
(Waller & Marshall, 1993) show no structural homology with ACE. 

A crystal according to any one of the preceding claims comprising atoms arranged in 
a spatial relationship represented by at least a portion of the structure co-ordinates of 
10 Table A or Table B. 

Preferably, the crystal belongs to the space group Y2\2\2\ and has a unit cell with 
dimensions of: a= 56.47 A, b=84.90 A, c=l 33.99 A. 

15 Preferably, the crystal is a crystal of human ACE protein. 

Complexes may be obtained by growing the crystals in the presence of an entity - 
such as a test compound. In these experiments the protein solution is mixed with the 
entity and an equal volume of the reservoir solution before setting up the 
20 crystallisation. Single crystals suitable for diffraction work typically appear after 
about 4 weeks. 

Typically, the protein comprising ACE is purified to homogeneity for crystallisation. 
Purity of ACE may be measured by typical techniques such as SDS-PAGE, mass 
25 spectrometry and hydrophobic HPLC. 

The crystal structure of the invention may contain a portion - such as at least 25%, at 
least 50%, at least 75%, or preferably at least 90%, at least 95%, at least 98%, or at 
least 99% - of the co-ordinates listed in Table A or Table B. Preferably, the crystal 
30 structure of the invention contains all of the co-ordinates listed in Table A or Table B. 



Preferably, the crystal is usable in X-ray crystallography techniques. 
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Preferably, the crystals used can withstand exposure to X-ray beams used to produce 
dif&action pattern data necessary to solve the X-ray crystallographic structure. 

Preferably, prior to data collection, the crystals are flash-cooled at about 100 K in a 
cryoprotectant. More preferably, cryoprotectant contains 15 % PEG 4000, 50 mM 
CH 3 COONa.3H 2 0 pH 4.7, 10 uM ZnS0 4 .7H 2 0 and 25% glycerol. 

The X-ray data may be collected at a Synchrotron Radiation Source. Preferably, the 
X-ray data are collected at a Synchrotron Radiation Source at 100 °K. 

Preferably, the crystal has a resolution determined by X-ray crystallography of about 
3.5A or less, more preferably a resolution of about 2.8 A or less, more preferably, a 
resolution of about 2 A or less, more preferably, a resolution of about 1.5A or less, most 
preferably, 1 A or less. 

DEGLYCOSYLATION AND UNDERGLYCOSYLATION 

Many proteins in eukaryotic cells are glycoproteins that contain oligosaccharide 
chains covalently linked to certain amino acids. Glycosylation is known to affect 
protein folding, localisation and trafficking, protein solubility, antigenicity, biological 
activity and half-life, as well as cell-cell interactions. 

Protein glycosylation can be divided into four main categories mainly depending on the 
linkage between the amino acid and the sugar. These are N-linked glycosylation, O- 
linked glycosylation, C-mannosylation and GPI anchor attachments. N-glycosylation is 
characterised by the addition of a sugar to the amino group of an asparagine. In O- 
glycosylation, a sugar is attached to the hydroxyl group of a serine or threonine residue. 

For N-glycosylation, the sequence motif Asn-Xaa-Ser/Thr (wherein Xaa is any amino 
acid other than Pro) has been defined as a prerequisite for glycosylation. Although rare, 
the sequence motif Asn-Xaa-Cys can also be an acceptor site. N-glycans can be 
subdivided into three distinct groups called high mannose type', 'hybrid type', and 
'complex type', with the common pentasaccharide core - Manp(alphal,6> 
(Manp(alphal,3))-Manp(betal,4)-GlcpNAc(betal,4) GlcpNAc(betal,N)-Asn - occuring 
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in all three groups. The relationship between all three types can be ascribed to the fact 
that they originate from one precursor oligosaccharide which contains the described 
common pentasaccharide core Man3-GlcNAc2, and some additional sugar residues and 
the non-reducing end, and is then processed enzymatically to yield these three types. 
5 Since the hydroxyl group of Ser/Thr is thought to be involved in hydrogen bonding 
during the enzymatic attachment of the oligosaccharide precursor molecule to yield a 
favourable conformation for the action of the oligosaccharyltransferase, it has been 
suggested for proline that the steric hindrance might be too large (Kornfeld (1985) Ann. 
Rev. Biochem. 54: 631-64), preventing glycosylation at Pro containing sites. The 

10 negative influence of asparuc acid towards glycosylation can be ascribed to the negative 
charge on the side chain of this residue. In addition some cases have been reported 
where Ser/Thr is replaced by cysteine. While Ser replacement by Cys generally leads to 
decreased glycosylation, it has been shown (Kasturi 1995 J. Biol. Chem. 270: 14756-61) 
that substitution by Thr at a given potential glycosylation site can lead to increased 

15 glycosylation. This is in accordance with the model of hydrogen bonding being an 
important factor during the attachment of the precursor molecule to the protein. 
Although there are usually many potential glycosylation sites in a protein it has been 
estimated that glycosylation occurs only at one third of them. Mostly at those sites where 
the surrounding amino acids allow the formation of a beta turn. 

20 

Various glycoforms of ACE have been described. By way of example, Sadhukhan & 
Sen (3a) disrupted specific glycosylation sites in rabbit tACE to elucidate the 
glycosylation requirements for the expression and processing of active testis ACE. 
There are five potential iST-linked glycosylation sites in the rabbit tACE sequence, with 

25 an additional six in the somatic form (4a). A null mutant, where all five sites had 
been disrupted, behaved similarly to wild-type tACE expressed in the presence of the 
glycosylation-inhibitor, tunicamycin. It was degraded intracellularly and failed to be 
detected in culture medium, confirming previous findings that tACE requires W-linked 
glycosylation to be expressed in an active form (5a and 3a). Expression of the 

30 remaining mutants showed a preference for //-linked glycosylation at the N-terminus 
and that the presence of sugars at a single N-terminal site was necessary and sufficient 
to produce enzymaticaUy-active tACE that was solubiUsed. The presence of 
glycosylation is not site-specific, as mutants that have either the first site or second 
site intact are expressed and active. However, glycosylation at the third site alone is 
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not sufficient to produce active protein in HeLa cells, albeit this mutant was expressed 
in yeast (3a), indicating that the requirements for glycosylation are cell-specific. 

In human testis ACE 0-linked sugars are not necessary for expression, implicating the 
5 JV-linked sugars in this role (6a). JV-linked glycosylation of human tACE expressed in 
CHO cells at each site has been identified by MALDI-TOF mass spectrometry (7a). 
There are seven potential iV-linked sites in human tACE, five of which are 
complementary to the sites in rabbit tACE (7a). The unique sites lie within the 
ectodomain (the fourth site) and in the juxtamembrane stalk region, adjacent to the 
10 cleavage site (the seventh site). As with the rabbit form, there appears to be a 
preference for glycosylation at the N-terminus as evidenced by MALDI-TOF mass 
spectrometry of glyosylation sites (7a). Inhibition of complex oligosaccharide 
formation using a glucosidase I inhibitor JV-butyldeoxynojirimycin (NB-DNJ) led to 
the production of an active glycoform that was electrophoretically homogeneous (7a). 

15 

Suitably, the crystal of the ACE protein may comprise de-glycosylated ACE protein 
or a fragment thereof For example, the deglycosylated ACE may comprise the 
sequence presented as SEQ ID No. 2. 

20 To deglycosylate the ACE protein, various methods known to a person skilled in the 
art may be used. Both chemical and enzymatic methods may be used for removing 
oligosaccharides from glycoproteins. Hydrazinolysis of glycoproteins (Kuraya, N & 
Hase (1992) J Biochem (Tokyo) 112:122-126), is capable of removing bothN- and O- 
linked sugars, although this results in the complete destruction of the protein 

25 component and is therefore not suitable if recovery of the protein is desirable. Milder 
chemical methods such as trifluoromethanesulphonic acid (TFMS) may be used, 
however this may result in incomplete sugar removal and partial protein destruction. 
Other methods — such as site directed mutagenesis of glycosylated amino acids may 
also be used. 

30 

Suitably, enzymatic methods may be used which provide for complete sugar removal 
with no protein degradation. 
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Use of the enzyme PNGase F is an effective method of removing virtually all N-linked 
oligosaccharides from glycoproteins (Tarentino & Plummer (1994). Methods in 
Enzymol 230: 44-57). The oligosaccharide is left intact and therefore suitable for further 
analysis (the asparagine residue from which the sugar was removed is deaminated to 
aspartic acid, the only modification to the protein). 

Other commonly used endoglycosidases include Endoglycosidase H (Kobata (1979) 
Anal Biochem 100:1-14) and Endoglycosidase F (Trimble & Tarentino (1991) J. 
Biochem. 266:1646-1651). In a preferred method, the ACE protein is digested with 
Endoglycosidase H (30 mU) in a suitable buffer - such as 100 mM sodium phosphate, 
0.1 mM ZnCl 2 and 1% BSA, pH 6.0 for 16 h at 37°C. The endo H-treated protein is 
passed through a lectin affinity column consisting of equal parts of concanavalin A, 
wheat germ, and lentil lectin, after equilibration with 20 mM Tris-HCl, 0.5 M NaCl at 
pH 7.4. The deglycosylated ACE is collected in the flowthrough. Free 
oligosaccharides and any other impurities are removed from the flowthrough fraction 
by a final lisinopril-Sepharose affinity chromatography step. The homogeneity of the 
ACE protein after deglycosylation is confirmed by SDS-PAGE on a 4-20% 
acrylamide gel and MALDI-TOF mass spectrometry. 

Commercially available kits may also be used - such as the E-DEGLY kit (Sigma- 
Aldrich, UK) and the GlycoFree Deglycosylation Kit (Glyco, Novato, USA) which 
removes both N- and O-linked glycans from glycoproteins. 

Preferably, ACE is crystallised using underglycosylated ACE protein. 

As used herein, the term "underglycosylated" means that one or more of the 
oligosaccharide chains covalently linked to amino acids in the glycosylated protein 
are no longer present. 

By way of example only, testis ACE is glycosylated at amino acids 72, 90 and 109 
and glycosylated partially at amino acids 155, 337 and 586. Accordingly, the 
"underglycosylated" testis ACE may not be glycosylated a one or more of amino 
acids 72, 90, 109, 155, 337 and 586. 
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The underglycosylated ACE protein may be prepared using various methods - such as 
site directed mutagenesis and glycosylation inhibition methods. For example, 
glycosylation inhibition methods using NB-DNJ may prevent the formation of complex 
oligosaccharides. NB-DNJ inhibits glucosidase I and prevents maturation of the sugars. 

5 

Preferably, the underglycosylated ACE protein is prepared by culturing host cells in 
the presence of N-butyldeoxynojirimycin (NB-DNJ). 

Preferably, underglycosylated ACE protein is prepared using site-directed 
10 mutagenesis. More preferably, the underglycosylated ACE protein comprises a 
mutation at amino acid 337 of SEQ ID No. 2 or amino acids 90, 109, 155, 337 and 
586 of SEQ ID No. 2. 

Underglycosylated mutants may yield crystals in the orthorhombic, P2*2i2i space 
15 group that diffract to 2.8A or less. Moreover, crystals from a truncated mutant of 
tACE that only have simple high mannose oligosaccharides are also grown in the 
orthorhombic space group which may diffract to better than 2.0 A resolution. 

PREPARING A CRYSTAL OF ACE PROTEIN 

20 

In another aspect, the present invention relates to a method of preparing a crystal of ACE 
protein, comprising the steps of (a) culturing host cells comprising an underglycosylated 
ACE protein; (b) purifying the underglycosylated ACE protein; and (c) crystallising the 
underglycosylated ACE protein. 

25 

Preferably, the ACE protein is underglycosylated by removing one or more 
glycosylation sites and/or one or more partially glycosylated sites. More preferably, 
the underglycosylated ACE protein comprises a mutation at amino acid 337 of SEQ ID 
No 2 or amino acids 90, 109, 155, 337 and 586 of SEQ ID No 2. 

30 

The ACE protein may be underglycosylated and purified using the methods described 
herein. 



WO 2004/024765 PCT/GB2003/003966 

20 

Preferably, the ACE protein is crystallised using about 10 mM HEPES and about 
0,1% PMSF with an equal volume of a reservoir solution containing about 15 % PEG 
4000, about 50 mM CH 3 COONa.3H 2 0 pH 4.7 and about 10 nM ZnS0 4 .7H 2 0. 

5 Preferably, the ACE protein is crystallised in the presence of an entity, for example, a 
modulator of ACE. 

MODULATORS OF ACE 

10 The role of ACE in the pathogenesis of hypertension has resulted in a search for 
modulators (eg. inhibitors) of the enzyme that could act as antihypertensive drugs (eg. 
US 3,891,616, US 3,947,575, US 4,052,511 and US 4,053,651). Therapeutic 
vasodepressors - such as Captopril and D-2-methyl-3-mercaptopropanoyl-L-proline - 
have been synthesised as ACE inhibitors. Numerous synthetic peptide derivatives have 

15 also been shown to be ACE inhibitors as disclosed in US 3,832,337. 

Natural substances that inhibit ACE include snake venom and those derived from 
foodstuffs — such as ACE inhibiting peptides produced by enzymatic hydrolysis of 
proteins, such as casein or fish meat protein (by Hiroyuki Ukeda, Nippon Nogei Kagaku 
20 Kaishi (Journal of Japan Society for Bioscience, Biotechnology, and Agrochemistry, 
66(1), 25-29 (1992)). 

ACE inhibiting synthetic substances include captopril (D-2-methyl-3- 
mercaptopropanoyl-L-proline) which has already been put to practical application as an 
25 orally administered vasodepressor. 

However, many ACE inhibiting substances exhibit side effects in many cases and special 
attention needs be exercised in safety aspects. 

30 The present invention permits the use of molecular design techniques to design, select 
and synthesise chemical entities and compounds, including ACE modulating 
compounds, capable of binding to ACE, in whole or in part. 
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Thus, in a further aspect, the present invention relates to a method of screening for a 
modulator of ACE wherein the method comprises the use of a crystal of ACE. 

Preferably, the method comprises the steps of: (a) providing at least a portion of the 
structure co-ordinates of Table A or Table B; (b) employing at least a portion of the 
structure co-ordinates of Table A or Table B to design or select or synthesise a 
putative modulator of ACE; (c) contacting the putative modulator of ACE with ACE 
or a mutant, variant, homologue, derivative or fragment thereof in the presence of a 
substrate; and (d) screening the putative modulator of ACE in an assay for the 
potential to modulate ACE. 

By way of example, the structure co-ordinates may be used to design compounds that 
bind to the enzyme and may alter the physical properties of the compounds (eg. 
solubility) or the enzyme itself. This invention may be used to design compounds that 
act as modulators - such as competitive inhibitors - of ACE by binding to all or a 
portion of the active site of ACE. Compounds may also be designed that act as non- 
competitive inhibitors of ACE. These as non-competitive inhibitors may bind to all or 
a portion of ACE already bound to its substrate and may be more potent and specific 
than known ACE inhibitors that compete only for the ACE active site. Similarly, 
non-competitive inhibitors that bind to and inhibit ACE whether or not it is bound to 
another chemical entity may be designed using the structure co-ordinates of ACE as 
described herein. 

By way of example, it may be found that the COOH-binding active site residue differs 
between the N and C domain active sites and/or that it may be amenable to the 
incorporation of a functionality that can covalently modify this residue to produce an 
irreversible inhibitor design. It has long been assumed that the COOH-binding 
residue is a positively charged arginine (M. A. Ondetti & D. W. Cushman (1981) in 
Biochemical Regulation of Blood Pressure (R. L. Soffer, ed.), Wiley, New York, 165- 
204), but the ACE-lisinopril X-ray crystal structure described herein, shows that it is a 
lysine, in the case of the C-domain active site. This may present an opportunity for 
covalent modification, by, for example, the introduction of an alkyl halide or halo- 
ketone functionality into the inhibitors that can alkylate the lysine amine, or a-ketone 
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or aldehyde that can form a Schiffs base with the lysine amine, or the use of activated 
ester or thioester groups, or other modified carboxyl groups susceptible to 
nucleophilic attack. 

5 In a preferred embodiment, at least a portion of the structure co-ordinates of Table A 
or Table B and/or the putative modulator of ACE and/or the substrate are provided on 
a machine-readable data storage medium comprising a data storage material encoded 
with machine readable data. 

10 An ACE crystal may be probed with a variety of different chemical entities or test 
compounds to determine optimal sites for interaction between modulators of ACE and 
the enzyme. For example, X-ray diffraction data collected from crystals grown in the 
presence of chemical entities or test compounds may allow the elucidation of how the 
chemical entities or test compounds interact with ACE. Molecules that bind to those 

15 sites can then be designed and synthesised and tested for their ACE modulating 
activity. 

The present invention may also allow the development of compounds that can 
isomerise to reaction intermediates in the chemical reaction of a substrate or other 

20 compound that binds to ACE. Thus, the time-dependent analysis of structural 
changes in ACE during its interaction with other molecules may be performed. The 
reaction intermediates of ACE may also be deduced from the reaction product in co- 
complex with ACE. Such information is especially useful to design improved 
analogues of known ACE modulators or to design new ACE modulators based on the 

25 reaction intermediates of the ACE enzyme and ACE-modulator complex. This may 
provide a new route for designing ACE modulators with high specificity and stability. 
Preferably, this provides a new route for designing ACE modulators with high 
specificity, high stability and low toxicity. 

30 Small molecule databases or test compounds may be screened for chemical entities or 
compounds that can bind in whole, or in part, to ACE. Thus, in a preferred 
embodiment, the putative ACE modulator is from a library of compounds or a 
database. In this screening, the quality of fit of such entities or compounds to the 



WO 2004/024765 



23 



PCT/GB2003/003966 



binding site may be judged by various methods - such as shape complementarity or 
estimated interaction energy (Meng, E. C. et al, J. Comp. Chem., 13, pp. 505-524 
(1992)). 

5 Because ACE protein or a mutant, variant, homologue, derivative or fragment thereof 
may crystallise in more than one crystal form, the structure co-ordinates of ACE, or 
portions thereof, may be particularly useful to solve the structure of other crystal 
forms of ACE. They may also be used to solve the structure of ACE mutants, ACE 
variants, ACE homologues, ACE derivatives, ACE fragments and ACE complexes. 

10 

Preferably, the structure co-ordinates of ACE are used to solve the structure of the 
crystalline form of any other protein with significant amino acid sequence homology 
to any functional domain of ACE. By way of example, molecular replacement may 
be used. In this method, the unknown crystal structure, whether it is another crystal 
15 form of ACE, an ACE mutant, an ACE variant, an ACE homologue (eg. another 
protein with significant amino acid sequence homology to any functional domain of 
ACE), an ACE derivative, an ACE fragments or an ACE co-complex may be 
determined using the ACE structure co-ordinates of the present invention. This 
method will provide a more accurate structural form for the unknown crystal more 
quickly and efficiently than attempting to determine such information ab initio. 

In a preferred embodiment of the present invention, the ACE crystal further comprises 
an entity bound to the ACE protein or a portion thereof. For example, ACE may be 
crystallised in complex with an entity that is an inhibitor of ACE eg. lisinopril. 

Preferably, the entity is bound to the ACE protein or a portion thereof by contacting 
one or more residues of the ACE protein selected from: His384, Ala385, Lys542, 
Tyr551, Tyr554, Glu415 and His544. 

The crystal structures of a series of such complexes may then be solved by molecular 
replacement or in combination with MAD (Multiwavelength Anomalous Dispersion) 
and/or MIRAS (Multiple Isomorphous Replacement with Anomalous Scattering) 
procedures - and compared with that of wild-type ACE. Potential sites for 
modification within the binding sites of the enzyme may thus be identified. This 
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information provides an additional tool for determining the most efficient binding 
interactions, for example, increased hydrophobic interactions, between ACE and a 
chemical entity or compound. 

The structures and complexes of ACE may be refined using computer software - such 
as X-PLOR (Meth. Enzymol., vol. 114 & 115, H. W. Wyckoff et al., eds., Academic 
Press (1985)), MLPHARE (Collaborative computational project Number 4. The 
CCP4 Suite: Programs for Protein Crystallography (1994) Acta Crystallogr. D 50, 
760-763) and SHARP [De La Fortelle, E. & Bricogne, G. Maximum-likelihood 
heavy-atom parameters refinement in the MIR and MAD methods (1997) Methods 
Enzymol. 276, 472-494). Preferably, the complexes are refined using the program 
CNS (BrOnger et al (1998) Acta Crystallogr. D 54, 905-921). During the final stages 
of refinement water molecules, ions and inhibitor molecules may be inserted in the 
structure. This information may thus be used to optimise known classes of ACE 
modulators, eg. ACE inhibitors, and more importantly, to design and synthesise novel 
classes of ACE modulators. 

The overall figure of merit may be improved by iterative solvent flattening, phase 
combination and phase extension with the program SOLOMON [Abrahams, J. P. & 
Leslie, A. G. W. Methods used in structure determination of bovine mitochondrial Fl 
ATPase. (1 996) Acta Crystallogr. D 52, 110-119]. 

The structure co-ordinates of ACE mutants provided in this invention also facilitate 
the identification of related proteins or enzymes analogous to ACE in function, 
structure or both, thereby further leading to novel therapeutic modes for treating or 
preventing ACE related diseases. 

The design of compounds that bind to or modulate ACE according to the present 
invention generally involves consideration of two factors. First, the compound must 
be capable of physically and structurally associating with ACE. Non-covalent 
molecular interactions important in the association of ACE with its substrate may 
include hydrogen bonding, van der Waals and hydrophobic interactions. Second* the 
compound must be able to assume a conformation that allows it to associate with 
ACE. Although certain portions of the compound may not directly participate in the 
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association with ACE, those portions may still influence the overall conformation of 
the molecule. This may have a significant impact on potency. Such conformational 
requirements include the overall three-dimensional structure and orientation of the 
chemical entity or compound in relation to all or a portion of a binding site of ACE, or 
the spacing between functional groups of a compound comprising several chemical 
entities that directly interact with ACE. 

The potential modulating or binding effect of a chemical compound on ACE may be 
analysed prior to its actual synthesis and testing by the use of computer modelling 
techniques. If the theoretical structure of the given compound suggests insufficient 
interaction and association with ACE, then synthesis and testing of the compound 
may be obviated. However, if computer modelling indicates a strong interaction, the 
molecule may be synthesised and tested for its ability to bind to ACE and modulate 
(eg. inhibit) using the fluorescent substrate assay of Thornberry et ah (2000) Methods 
Enzymol. 322, pp 100-110. In this manner, synthesis of inactive compounds may be 
avoided. 

A modulating or other binding compound of ACE may be computationally evaluated 
and designed by means of a series of steps in which chemical entities or test 
compounds are screened and selected for their ability to associate ACE. 

A person skilled in the art may use one of several methods to screen chemical entities 
or test compounds for their ability to associate with ACE and more particularly with 
the individual binding sites of ACE. This process may begin by visual inspection of, 
for example, the active site on the computer screen based on the ACE co-ordinates of 
the present invention. Selected chemical entities or test compounds may then be 
positioned in a variety of orientations, or docked, with ACE. Docking may be 
accomplished using software such as Quanta and Sybyl, followed by energy 
m in imi sation and molecular dynamics with standard molecular mechanics force fields 
- such as CHARMM and AMBER. 

Specialised computer programs may also assist in the process of selecting chemical 
entities or test compounds. These include but are not limited to MCSS (Miranker and 
Karplus (1991) Proteins: Structure, Function and Genetics, 11, pp. 29-34); GRID 
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(Goodford (1985) J. Med. Chem., 28, pp. 849-857) and AUTODOCK (Goodsell and 
Olsen (1990), Proteins: Structure. Function, and Genetics, 8, pp. 195-202. 

Once suitable chemical entities or test compounds have been selected, they may be 
assembled into a single compound - such as an ACE modulator. Assembly may 
proceed by visual inspection of the relationship of the chemical entities or test 
compounds in relation to the structure co-ordinates of ACE. This may be followed by 
manual model building using software - such as Quanta, Sybyl or O [Jones, T. A., 
Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. Improved methods for building protein 
models in electron density maps and the location of errors in these models (1991) 
Acta Crystallogr. A 47, 1 10-1 19]. 

Refinement of the model may be carried out using the program CNS [Brttnger, A. T. 
et al. Crystallography & NMR System: A new software suite for macromolecular 
structure determination. (1998) Acta Crystallogr. D 54, 905-921]. 

Various programs may be used by a skilled person to connect the individual chemical 
entities or test compounds - such as 3D Database systems (Martin (1992) J. Med. 
Chem., 35, pp. 2145-2154) and CAVEAT (Bartlett et al. (1989) Royal Chem. Soc. 78, 
pp. 182-196). 

Rather than build an ACE inhibitor one chemical entity at a time, modulating or other 
ACE binding compounds may be designed as a whole or de novo using either an 
empty binding site or optionally including some portion(s) of a known inhibitors). 
Such compounds may be designed using programs that may include but are not 
limited to LEGEND (Nishibata and Itai (1991) Tetrahedron, 47, p. 8985) and LUDI 
(Bohm (1992) J. Comp. Aid. Molec. Design, 6, pp. 61-78). 

Other molecular modelling techniques may also be employed in accordance with this 
invention - such as those described by Cohen et al., J. Med. Chem., 33, pp. 883-894 
(1990); Navia and Murcko (1992) Current Opinions in Structural Biology, 2, pp. 202- 
210 (1992). 
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Once a compound has been designed or selected by the above methods, the efficiency 
with which that compound may bind to ACE may be computationally evaluated. 
Specific computer software may be used to evaluate the efficiency of binding (eg. to 
evaluate compound deformation energy and electrostatic interaction) - such as 
QUANTA/CHARMM (Accelrys Inc., USA) and Insight II/Discover (Biosym 
Technologies Inc., San Diego, Calif., USA). These programs may be implemented, 
for instance, using a suitable workstation. Other hardware systems and software 
packages will be known to those persons skilled in the art. 

Once an ACE-modulating compound has been selected or designed, as described 
above, substitutions may be made (eg. in atoms or side groups) to improve or modify 
the binding properties. The substitutions may be conservative ie. the replacement 
group may have approximately the same size, shape, hydrophobicity and charge as the 
original group. Such substituted chemical compounds may then be analysed for 
efficiency of binding to ACE by the same computer methods described above. 

Test compounds and modulators of ACE etc. which are identified using the crystal 
and the methods of the present invention may be screened in assays. Screening can 
be, for example in vitro, in cell culture, and/or in vivo. Biological screening assays 
preferably centre on activity-based response models, binding assays (which measure 
how well a compound binds), and bacterial, yeast and animal cell lines (which 
measure the biological effect of a compound in a cell). The assays can be automated 
for high capacity-high throughput screening (HTS) in which large numbers of 
compounds can be tested to identify compounds with the desired activity. 

Current screening technologies are described in Handbook of Drug Screening, edited 
by Ramakrishna Seethala, Prabhavathi B. Fernandes. New York, NY, Marcel Dekker, 
(2001). 

ACE RELATED DISORDERS 

ACE related disorders include, but are not limited to, treatment of high blood pressure; 
treatment of heart failure; prolonging survival of patients who have had a heart attack; 
preventing death by heart attack and stroke in patients with vascular disease and in 
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diabetics with other vascular risk factors; prolonging survival of patients with weak heart 
muscle; helping leaking heart valves; preserving kidney function in diabetics; and the 
treatment of new indications (e.g. polycythemia). Special groups of patients may also be 
treated with ACE inhibitors, including: patients with chronic pulmonary disease; patients 
5 with scleroderma; patients with atheroschlerosis; and patients with hyperuricemia. 

ACE CONSTRUCTS 

The ACE proteins produced by a host recombinant cell may be secreted or may be 
10 contained intracellularly depending on the nucleotide sequence and/or the vector used. 

As will be understood by those of skill in the art, expression vectors containing an ACE 
encoding nucleotide sequence or a mutant, variant, homologue, derivative or fragment 
thereof, may be designed with signal sequences which direct secretion of the ACE 
15 coding sequences through a particular prokaryotic or eukaryotic cell membrane. 

The ACE encoding sequence may be fused (eg. ligated) to nucleotide sequences 
encoding a polypeptide domain which will facilitate purification of soluble proteins 
(Kroll DJet al (1993) DNA Cell Biol 12:441-53). Preferably, the polypeptide domain 

20 which facilitates purification of soluble proteins is fused in frame with the ACE 
encoding sequence. Such purification facilitating domains include, but are not limited 
to, metal chelating peptides - such as histidine-tryptophan modules that allow 
purification on immobilised metals (Porath J (1992) Protein Expr Purif 3, 263-281), 
protein A domains that allow purification on immobilised immunoglobulin, and the 

25 domain utilised in the FLAGS extension/affinity purification system (hnmunex Corp, 
Seattle, WA). The inclusion of a cleavable linker sequence such as Factor XA or 
enterokinase (Invitrogen, San Diego, CA) between the purification domain and ACE is 
useful to facilitate purification. 

30 Preferably, the ACE construct is pEE-ACEANJ which encodes human tACE that 
lacks the heavily 0-glycosylated, 36-residue N-terminal sequence and is truncated 
after Ser625, thereby lacking most of the juxtamembrane stalk as well as the 
transmembrane and cytoplasmic domains as described in (7a). 
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In another preferred embodiment, the ACE construct comprises an underglycosylated 
ACE, constructed by the removal of one or more glycosylation sites - such as one or 
more jV-linked glycosylation sites. Glycosylation may be abolished using various 
methods known to a person skilled in the art as previously described. Preferably, a 
truncated form of tACE (tACEA36 lacking the first N-terminal 36 residues as well as 
the cytoplasmic domain is used for the construction of mutants (7a) and mutagenic 
oligonucleotides then used for altering the sites that are glycosylated. Preferably, the 
nucleotide sequence of each fragment is confirmed by DNA sequencing to ensure that 
only the desired mutation is created. 

Preferably, the ACE construct comprising underglycosylated ACE is pLEN- 
tACEA36g(l, 2, 3, 4) or is pLEN-tACEA36g(l , 3). 

HOST CELLS 

As used herein, the term "host cell" refers to any cell that comprises nucleotide 
sequences that are of use in the present invention, for example, nucleotide sequences 
encoding ACE. 

Host cells may be transformed or transfected with a nucleotide sequence contained in 
a vector e.g. a cloning vector. Preferably, said nucleotide sequence is carried in a 
vector for the replication and/or expression of the nucleotide sequence. The cells will 
be chosen to be compatible with the said vector and may for example be prokaryotic 
(for example bacterial), fungal, yeast or plant cells. 

The gram-negative bacterium R coli is widely used as a host for cloning nucleotide 
sequences. This organism is also widely used for heterologous nucleotide sequence 
expression. However, large amounts of heterologous protein tend to accumulate 
inside the cell. Subsequent purification of the desired protein from the bulk of E. coli 
intracellular proteins can sometimes be difficult. 
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In contrast to E. coli, bacteria from the genus Bacillus are very suitable as 
heterologous hosts because of their capability to secrete proteins into the culture 
medium. Other bacteria suitable as hosts are those from the genera Streptomyces and 
Pseudomonas. 

5 

Depending on the nature of the polynucleotide and/or the desirability for further 
processing of the expressed protein, eukaryotic hosts including yeasts or other fungi 
may be preferred. In general, yeast cells are preferred over fungal cells because yeast 
cells are easier to manipulate. However, some proteins are either poorly secreted 
10 from the yeast cell, or in some cases are not processed properly (e.g. 
hyperglycosylation in yeast). In these instances, a different fungal host organism 
should be selected. 

Examples of expression hosts are fungi - such as Aspergillus species (such as those 
15 described in EP-A-01 84438 and EP-A-0284603) and Trichoderma species; bacteria - 
such as Bacillus species (such as those described in EP-A-01 34048 and EP-A- 
0253455), Streptomyces species and Pseudomonas species; yeasts - such as 
Kluyveromyces species (such as those described in EP-A-0096430 and EP-A- 
0301670) and Saccharomyces species; and mammalian cells - such as CHO-K1 cells. 

20 

The use of host cells may provide for post-translational modifications (eg. 
glycosylation) as may be needed to confer optimal biological activity on recombinant 
expression products of the present invention. 

25 Aspects of the present invention also relate to host cells comprising the ACE 
constructs of the present invention. The ACE constructs may comprise a nucleotide 
sequence for replication and expression of the sequence. The cells will be chosen to 
be compatible with the vector and may for example be prokaryotic (for example 
bacterial), fungal, yeast or plant cells. 

30 

In a preferred embodiment, the host cells are mammalian cells - such as CHO-K1 
cells. CHO-K1 cells expressing ACE may be grown and maintained in accordance 
withYuefa/. (1997). 
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NUCLEOTIDE SEQUENCES 

As used herein, the term "nucleotide sequence" refers to nucleotide sequences, 
oligonucleotide sequences, polynucleotide sequences and variants, homologues, 
5 fragments and derivatives thereof (such as portions thereof) which comprise the 
nucleotide sequences encoding ACE, for example, testis ACE or somatic ACE. 

The nucleotide sequence may be DNA or RNA of genomic or synthetic or 
recombinant origin, which may be double-stranded or single-stranded whether 
10 representing the sense or antisense strand or combinations thereof. 

Preferably, the term nucleotide sequence is prepared by use of recombinant DNA 
techniques (e.g. recombinant DNA). The nucleotide sequences may include within 
them synthetic or modified nucleotides. A number of different types of modification 
15 to oligonucleotides are known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3 1 and/or 
5' ends of the molecule. For the purposes of the present invention, it is to be 
understood that the nucleotide sequences described herein may be modified by any 
method available in the art. 

20 

It will be understood by a skilled person that numerous different nucleotide sequences 
can encode the same protein as a result of the degeneracy of the genetic code. In 
addition, it is to be understood that skilled persons may, using routine techniques, make 
nucleotide substitutions that do not substantially affect the activity encoded by the 

25 nucleotide sequence of the present invention to reflect the codon usage of any particular 
host organism in which the target is to be expressed. Thus, the terms "variant", 
"homologue" or "derivative" in relation to nucleotide sequences include any substitution 
of, variation of, modification of, replacement of, deletion of or addition of one (or more) 
nucleic acids from or to the sequence providing the resultant nucleotide sequence 

30 encodes a functional protein according to the present invention (or even a modulator of 
ACE according to the present invention if said modulator comprises a nucleotide 
sequence or an amino acid sequence). 
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AMINO ACID SEQUENCES 

As used herein, the term "amino acid sequence" is synonymous with the term 
"polypeptide" and/or the term "protein". In some instances, the term "amino acid 
5 sequence" is synonymous with the term "peptide". In some instances, the term 
"amino acid sequence" is synonymous with the term "protein". 

Aspects of the present invention concern the use of amino acid sequences, which may 
be available in databases. These amino acid sequences may comprise ACE proteins. 

10 

The amino acid sequence may be isolated from a suitable source, or it may be made 
synthetically or it may be prepared by use of recombinant DNA techniques. 

Preferably, ACE comprises SEQ ID No. 1 or SEQ ID No. 2, or a mutant, variant, 
15 . homologue, derivative or fragment thereof. More preferably, ACE comprises SEQ ID 
No. 2, or a mutant, variant, homologue, derivative or fragment thereof. 

PURITY 

20 Preferably the protein solution used for crystallisation is at least 97.5% pure. More 
preferably, the protein solution used for crystallisation is at least 99.0% pure. Most 
preferably, the protein solution used for crystallisation is at least 99.5% pure. 

MODEL 

25 

As used herein, the term "model" refers to a structural model such as a three 
dimensional (3D) structural model (or representation thereof) comprising ACE. 

Test compounds can be modelled that bind spatially and preferentially to ACE - such 
30 as to bind spatially and preferentially to ACE - for example, the active site of ACE. 

Preferably, the crystal model comprising ACE is built from all or a portion of the 
structure co-ordinates presented in Table A or Table B. 
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MUTANT 

As used herein, the term "mutant" refers to ACE comprising any one or more changes 
in the wild-type ACE sequence shown as SEQ ID No. 1 or one or more changes in the 
5 native ACE sequence shown as SEQ ID No. 2. 

The term "mutant" is not limited to any of the mutations described herein which are 
reflected in amino acid substitutions of the amino acid residues in ACE, but may also 
include, but are not limited to, other deletions or insertions of nucleotides which may 
10 result in changes in the amino acid residues in the amino acid sequence of ACE. 

In a preferred embodiment, mutations are located at amino acids 337 of SEQ ID No. 2 
or amino acids 90, 109, 155, 337 and 586 of SEQ ID No. 2. 

15 The present invention also enables the solving of the crystal structure of ACE mutants. 
More particularly, by virtue of the present invention, the location of the active site of 
ACE based on its crystal structure permits the identification of desirable sites for 
mutation. For example, one or more mutations may be directed to a particular site - such 
as the active site - or combination of sites of ACE. Similarly, only a location on, at or 

20 near the enzyme surface may be replaced, resulting in an altered surface charge of one or 
more charge units, as compared to the wild-type enzyme. Alternatively, an amino acid 
residue in ACE may be chosen for replacement based on its hydrophilic or hydrophobic 
characteristics. 

25 Such mutants may be characterised by any one of several different properties as 
compared with wild-type ACE. For example, such mutants may have altered surface 
charge of one or more charge units, or have an increased stability to subunit 
dissociation, or an altered substrate specificity in comparison with, or a higher 
specific activity than, wild-type ACE. 

30 

The mutants may be prepared in a number of ways that are known by a person skilled 
in the art. For example, mutations may be introduced by means of oligonucleotide- 
directed mutagenesis or other conventional methods. Alternatively, mutants of ACE 
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may be generated by site specific replacement of a particular amino acid with an 
unnaturally occurring amino acid. This may be achieved by growing a host organism 
capable of expressing either the wild-type or mutant polypeptide on a growth medium 
depleted of one or more natural amino acids, but enriched in one or more 
5 corresponding unnaturally occurring amino acids. 

The expression, activity (eg. kinetic constants) and/or the crystallisation properties of 
the mutants may be determined using the methods described herein. 

10 VARIANTS/HOMOLOGUES/DERIVATIVES/FRAGMENTS 

The ACE described herein is intended to include any polypeptide, which has the 
activity of the naturally occurring ACE and includes all vertebrate and mammalian 
forms. Such terms also include polypeptides that differ from naturally occurring 
15 forms of ACE by having amino acid deletions, substitutions, and additions, but which 
retain the activity of ACE. 

The term "variant' 9 is used to mean a naturally occurring polypeptide or nucleotide 
sequences which differs from a wild-type or a native sequence. 

20 

The term "fragment" indicates that a polypeptide or nucleotide sequence comprises a 
fraction of a wild-type or a native sequence. It may comprise one or more large 
contiguous sections of sequence or a plurality of small sections. The sequence may also 
comprise other elements of sequence, for example, it may be a fusion protein with 
25 another protein. Preferably the sequence comprises at least 50%, more preferably at 
least 65%, more preferably at least 80%, most preferably at least 90% of the wild-type 
sequence. 

The present invention also encompasses the use of variants, homologues and derivatives 
30 of nucleotide and amino acid sequences. Here, the term "homologue" means an entity 
having a certain homology with amino acid sequences or nucleotide sequences. Here, 
the term "homology" can be equated with "identity". 
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In the present context, an homologous sequence is taken to include an amino acid 
sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% 
identical to the subject sequence - such as ACE or a functional domain thereof. 

5 Although homology can also be considered in terms of similarity (i.e. amino acid 
residues having similar chemical properties/functions), it is preferred to express 
homology in terms of sequence identity. 

A significantly homologous amino acid sequence is taken to include an amino acid 
10 sequence which may be at least 75, 85 or 90% identical, preferably at least 95 or 98% 
identical to the subject sequence - such as ACE or a functional domain thereof. 

An homologous sequence is taken to include a nucleotide sequence which may be at 
least 75, 85 or 90% identical, preferably at least 95 or 98% identical to the subject 
15 sequence — such as ACE or a functional domain thereof. 

Homology comparisons can be conducted by eye, or more usually, with the aid of 
readily available sequence comparison programs. These commercially available 
computer programs can calculate % homology between two or more sequences. 

20 

% homology may be calculated over contiguous sequences, i.e. one sequence is 
aligned with the other sequence and each amino acid in one sequence is directly 
compared with the corresponding amino acid in the other sequence, one residue at a 
time. This is called an c6 ungapped" alignment. Typically, such ungapped alignments 
25 are performed only over a relatively short number of residues. 

Although this is a very simple and consistent method, it fails to take into 
consideration that, for example, in an otherwise identical pair of sequences, one 
insertion or deletion will cause the following amino acid residues to be put out of 
30 alignment, thus potentially resulting in a large reduction in % homology when a 
global alignment is performed. Consequently, most sequence comparison methods 
are designed to produce optimal alignments that take into consideration possible 
insertions and deletions without penalising unduly the overall homology score. This 
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is achieved by inserting "gaps" in the sequence alignment to try to maximise local 
homology. 

However, these more complex methods assign "gap penalties" to each gap that occurs 
5 in the alignment so that, for the same number of identical amino acids, a sequence 
alignment with as few gaps as possible - reflecting higher relatedness between the two 
compared sequences - will achieve a higher score than one with many gaps. "Affine 
gap costs" are typically used that charge a relatively high cost for the existence of a 
gap and a smaller penalty for each subsequent residue in the gap. This is the most 

10 commonly used gap scoring system. High gap penalties will of course produce 
optimised alignments with fewer gaps. Most alignment programs allow the gap 
penalties to be modified. However, it is preferred to use the default values when 
using such software for sequence comparisons. For example when using the GCG 
Wisconsin Bestfit package the default gap penalty for amino acid sequences is -12 for 

15 a gap and -4 for each extension. 

Calculation of maximum % homology therefore firstly requires the production of an 
optimal alignment, taking into consideration gap penalties. A suitable computer 
program for carrying out such an alignment is the GCG Wisconsin Bestfit package 

20 (University of Wisconsin, U.SA.; Devereux et al, 1984, Nucleic Acids Research 
12:387). Examples of other software than can perform sequence comparisons 
include, but are not limited to, the BLAST package (see Ausubel et al 9 1999 ibid - 
Chapter 18), FASTA (Atschul et aL, 1990, J. Mol. Biol., 403-410) and the 
GENEWORKS suite of comparison tools. Both BLAST and FASTA are available for 

25 offline and online searching (see Ausubel et al., 1999 ibid, pages 7-58 to 7-60). 
However, for some applications, it is preferred to use the GCG Bestfit program. A 
new tool, called BLAST 2 Sequences is also available for comparing protein and 
nucleotide sequence (see FEMS Microbiol Lett 1999 174(2): 247-50; FEMS 
Microbiol Lett 1999 177(1): 187-8) 

30 

Although the final % homology can be measured in terms of identity, the alignment 
process itself is typically not based on an all-or-nothing pair comparison. Instead, a 
scaled similarity score matrix is generally used that assigns scores to each pairwise 
comparison based on chemical similarity or evolutionary distance. An example of 
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such a matrix commonly used is the BLOSUM62 matrix - the default matrix for the 
BLAST suite of programs. GCG Wisconsin programs generally use either the public 
default values or a custom symbol comparison table if supplied (see user manual for 
further details). For some applications, it is preferred to use the public default values 
5 for the GCG package, or in the case of other software, the default matrix, such as 
BLOSUM62. 

Once the software has produced an optimal alignment, it is possible to calculate % 
homology, preferably % sequence identity. The software typically does this as part of 
10 the sequence comparison and generates a numerical result. 

By way of example, homologous sequences of ACE include, but are not limited to 
human ACEs somatic ACE (accession number: J04144), human ACEt testis ACE 
(accession number: M26657), human ACEH/ACE2 (accession numbers: AAF78220; 

15 BAB40370; AAF99721), chimp ACE T (accession number: AF193487_2), rabbit 
ACEt mature protein (accession number: P22968), rabbit ACEt full pre-protein 
(accession number: P22968), mouse ACEt testis ACE (accession number: P22967), 
bovine Cdom ACEs C-domain, rat Cdom ACEs C-domain (derived from accession 
number P47820; starting D616), human Ndom ACEs N-domain (derived from 

20 accession number PI 2821 (J04144)), chimp Ndom ACEs N-domain (derived from 
accession number AF193487_1), rabbit Ndom ACEs N-domain (derived from 
P12822), bovine Ndom (Bovine {Bos taurus} ACE S N-domain), mouse Ndom ACE S 
N-domain (derived from accession number P09470), rat Ndom ACEs N-domain 
(derived from accession number P47820), chick ACE (partial ACE accession number 

25 Q10751), dros AnCE (derived from accession number Q10714), dros ACEr (derived 
from accession number X96913), buffalo fly ACE (derived from accession number 
Q10715), and silkworm ACE (derived from accession number BAA97657), tick ACE 
(derived from accession number U62809). 

30 The sequences may also have deletions, insertions or substitutions of amino acid 
residues, which produce a silent change and result in a functionally equivalent 
substance. Deliberate amino acid substitutions may be made on the basis of similarity 
in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues as long as the secondary binding activity of the substance is 
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retained. For example, negatively charged amino acids include aspartic acid and 
glutamic acid; positively charged amino acids include lysine and arginine; and amino 
acids with uncharged polar head groups having similar hydrophilicity values include 
leucine, isoleucine, valine, glycine, alanine, asparagine, glutamine, serine, threonine, 
5 phenylalanine, and tyrosine. 

Conservative substitutions may be made, for example according to the Table below, 
Amino acids in the same block in the second column and preferably in the same line 
in the third column may be substituted for each other: 

10 



ALIPHATIC 


Non-polar 


GAP 1 






ILV 




Polar - uncharged 


CSTM 






NQ 




Polar - charged 


DE 






KR 


AROMATIC 




HFWY 



Homologous substitution (substitution and replacement are both used herein to mean 
the interchange of an existing amino acid residue, with an alternative residue) may 
occur i.e. like-for-like substitution such as basic for basic, acidic for acidic, polar for 
15 polar etc. Non-homologous substitution may also occur i.e. from one class of residue 
to another or alternatively involving the inclusion of unnatural amino acids such as 
ornithine (hereinafter referred to as Z), diaminobutyric acid ornithine (hereinafter 
referred to as B), norleucine ornithine (hereinafter referred to as O), pyriylalanine, 
thienylalanine, naphthylalanine and phenylglycine. 

20 

Replacements may also be made by unnatural amino acids include; alpha* and alpha- 
disubstituted* amino acids, N-alkyl amino acids*, lactic acid*, halide derivatives of 
natural amino acids such as trifluorotyrosine*, p-Cl-phenylalanine*, p-Br- 
phenylalanine*, p-I-phenylalanine*, L-allyl-glycine*, B-alanine*, L-a-amino butyric 
25 acid*, L-y-amino butyric acid*, L-a-amino isobutyric acid*, L-s-amino caproic acid*, 7- 
amino heptanoic acid*, ^methionine sulfone**, L-norleucine*, L-norvaline*, p-nitro-L- 
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phenylalanine*, L-hydroxyproline # , L-thioproline*, methyl derivatives of phenylalanine 
(Phe) such as 4-methyl-Phe*, pentamethyl-Phe*, L-Phe (4-amino) # , L-Tyr (methyl)*, L- 
Phe (4-isopropyl)*, L-Tic (l,2,3,4-tetrahydroisoqxiinoline-3-carboxyl acid)*, L- 
diaminopropionic acid # and L-Phe (4-benzyl)*. The notation * has been utilised for the 
5 purpose of the discussion above (relating to homologous or non-homologous 
substitution), to indicate the hydrophobic nature of the derivative whereas # has been 
utilised to indicate the hydrophilic nature of the derivative, #* indicates amphipathic 
characteristics. 

10 The term "derivative 11 or "derivatised" as used herein includes chemical modification of 
an entity - such as test compound or an ACE modulator. Illustrative of such chemical 
modifications would be replacement of hydrogen by a halo group, an alkyl group, an 
acyl group or an amino group. 

15 Variant amino acid sequences may include suitable spacer groups that may be inserted 
between any two amino acid residues of the sequence including alkyl groups such as 
methyl, ethyl or propyl groups in addition to amino acid spacers such as glycine or p- 
alanine residues. A further form of variation, involves the presence of one or more 
amino acid residues in peptoid form, will be well understood by those skilled in the 

20 art. For the avoidance of doubt, 'the peptoid form" is used to refer to variant amino 
acid residues wherein the a-carbon substituent group is on the residue's nitrogen atom 
rather than the a-carbon. Processes for preparing peptides in the peptoid form are 
known in the art, for example Simon RJ et al., PNAS (1992) 89(20), 9367-9371 and 
Horwell DC, Trends Biotechnol. (1995) 13(4), 132-134. 

25 

TEST COMPOUNDS 

As used herein, the term "test compound" includes, but is not limited to, a compound 
which may be obtainable from or produced by any suitable source, whether natural or 
30 not. 



The test compound may be designed or obtained from a library of compounds, which 
may comprise peptides, as well as other compounds, such as small organic molecules 
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and particularly new lead compounds. By way of example, the test compound may be 
a natural substance, a biological macromolecule, or an extract made from biological 
materials - such as bacteria, fungi, or animal (particularly mammalian) cells or tissues, 
an organic or an inorganic molecule, a synthetic test compound, a semi-synthetic test 
5 compound, a structural or functional mimetic, a peptide, a peptidomimetics, a 
derivatised test compound, a peptide cleaved from a whole protein, or a peptide 
synthesised synthetically (such as, by way of example, either using a peptide 
synthesiser or by recombinant techniques or combinations thereof, a recombinant test 
compound, a natural or a non-natural test compound, a fusion protein or equivalent 
10 thereof and mutants, derivatives or combinations thereof. The test compound may 
even be a compound that is a modulator of ACE - such as a known inhibitor of ACE - 
that has been modified in some way eg. by recombinant DNA techniques or chemical 
synthesis techniques. 

15 Typically, the test compound will be prepared by recombinant DNA techniques 
and/or chemical synthesis techniques. 

Once a test compound capable of interacting ACE has been identified, further steps 
may be carried out to select and/or to modify the test compounds and/or to modify 
20 existing compounds, such that they are able to modulate ACE. 

MODULATING ACE 

As herein, the term "modulating" refers to preventing, suppressing, inhibiting, 
25 alleviating, restorating, elevating, increasing or otherwise affecting ACE. 

The term "ACE modulator" may refer to a single entity or a combination of entities. 

The ACE modulator may be an antagonist or an agonist of ACE. 

30 

As used herein, the term "agonist" means any entity, which is capable of interacting 
(eg. binding) with ACE and which is capable of increasing a proportion of the ACE 
that is in an active form, resulting in an increased biological response. 
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As used herein, the term "antagonist 55 means any entity, which is capable of 
interacting (eg. binding) with ACE and which is capable of decreasing (eg. inhibiting) 
a proportion of the ACE that is in an active form, resulting in a decreased biological 
response. 

5 

Preferably, the ACE modulators of the present invention are antagonists of ACE. 

The modulator of ACE may be an organic compound or other chemical. The 
modulator of ACE may be a compound, which is obtainable from or produced by any 
10 suitable source, whether natural or artificial. The modulator of ACE may be an amino 
acid molecule, a polypeptide, or a chemical derivative thereof, or a combination 
thereof. The modulator of ACE may even be a polynucleotide molecule - which may 
be a sense or an anti-sense molecule. The modulator of ACE may even be an 
antibody. 

15 

The modulator of ACE may be designed or obtained from a library of compounds, 
which may comprise peptides, as well as other compounds, such as small organic 
molecules. 

20 By way of example, the modulator of ACE may be a natural substance, a biological 
macromolecule, or an extract made from biological materials such as bacteria, fungi, 
or animal (particularly mammalian) cells or tissues, an organic or an inorganic 
molecule, a synthetic agent, a semi-synthetic agent, a structural or functional mimetic, 
a peptide, a peptidomimetic, a derivatised agent, a peptide cleaved from a whole 

25 protein, or a peptide synthesised synthetically (such as, by way of example, either 
using a peptide synthesiser or by recombinant techniques or combinations thereof, a 
recombinant agent, an antibody, a natural or a non-natural agent, a fusion protein or 
equivalent thereof and mutants, derivatives or combinations thereof). 

30 Typically, the modulator of ACE will be an organic compound. Typically, the 
organic compounds will comprise two or more hydrocarbyl groups. Here, the term 
"hydrocarbyl group" means a group comprising at least C and H and may optionally 
comprise one or more other suitable substituents. Examples of such substituents may 
include halo-, alkoxy-, nitro-, an alkyl group, a cyclic group etc. In addition to the 
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possibility of the substituents being a cyclic group, a combination of substituents may 
form a cyclic group. If the hydrocarbyl group comprises more than one C then those 
carbons need not necessarily be linked to each other. For example, at least two of the 
carbons may be linked via a suitable element or group. Thus, the hydrocarbyl group 

5 may contain hetero atoms. Suitable hetero atoms will be apparent to those skilled in 
the art and include, for instance, sulphur, nitrogen and oxygen. For some 
applications, preferably the modulator of ACE comprises at least one cyclic group. 
The cyclic group may be a polycyclic group, such as a non-fused polycyclic group. 
For some applications, the modulator of ACE comprises at least the one of said cyclic 

10 groups linked to another hydrocarbyl group. 

The modulator of ACE may contain halo groups, for example, fluoro, chloro, bromo or 
iodo groups. , 

15 The modulator of ACE may contain one or more of alkyl, alkoxy, alkenyl, alkylene 
and alkenylene groups — which may be unbranched- or branched-chain. 

The modulator of ACE may be in the form of a pharmaceutically acceptable salt — 
such as an acid addition salt or a base salt - or a solvate thereof, including a hydrate 
20 thereof. For a review on suitable salts see Berge et al, (1977) J. Pharm. ScL 66, 1-19. 

The modulator of ACE may be structurally novel modulators of ACE. 

The modulators of ACE may be analogues of known modulators of ACE — such as 
25 known inhibitors of ACE (for example, snake venom, peptides produced by 
enzymatic hydrolysis of casein or fish meat protein, or Benazepril, Captopril, 
Cilazapril, Enalapril, Fosinopril, Lisinopril, Moexipril, Perindopril, Quinapril, 
Ramipril, Trandolapril and Enalaprilat. 

30 Preferably, the ACE modulators have improved properties over those previously 
available, for example, fewer side effects - such as cough (eg. dry, persistent); fever and 
chills; hoarseness; swelling of face, mouth, hands, or feet; trouble in swallowing or 
breathing; itching of skin; yellow eyes or skin; dizziness, light-headedness, or fainting; 
skin rash, with or without itching; fever, or joint pain; abdominal pain, abdominal 
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distention; nausea, or vomiting; chest pain, confusion; irregular heartbeat; nervousness; 
numbness or tingling in hands, feet, or lips; weakness or heaviness of legs; headache, 
diarrhoea; loss of taste; nausea; unusual tiredness. 

5 The modulator of ACE may be a mimetic. 

The modulator of ACE may also be chemically modified. 

The modulator of ACE may be capable of displaying other therapeutic properties. 

10 

The modulator of ACE may be used in combination with one or more other 
pharmaceutically active agents. 

If combinations of active agents are administered, then they may be administered 
15 simultaneously, separately or sequentially. 

MIMETIC 

As used herein, the term "mimetic" relates to any chemical which includes, but is not 
20 limited to, a peptide, polypeptide, antibody or other organic chemical which has the 
same qualitative activity or effect as a known compound. That is, the mimetic is a 
functional equivalent of a known compound. 

STEREO AND GEOMETRIC ISOMERS 

25 

Modulators of ACE may exist as stereoisomers and/or geometric isomers - e.g. they 
may possess one or more asymmetric and/or geometric centres and so may exist in 
two or more stereoisomeric and/or geometric forms. The present invention 
contemplates the use of all the individual stereoisomers and geometric isomers, and 
30 mixtures thereof. 



PHARMACEUTICAL SALT 
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Modulators of ACE may be administered in the form of a pharmaceutically acceptable 
salt. 

Pharmaceutically-acceptable salts are well known to those skilled in the art, and for 
example include those mentioned by Berge et al, (1977) J. Pharm. Sci., 66, 1-19. 
Suitable acid addition salts are formed from acids which form non-toxic salts and 
include the hydrochloride, hydrobromide, hydroiodide, nitrate, sulphate, bisulphate, 
phosphate, hydrogenphosphate, acetate, trifluoroacetate, gluconate, lactate, salicylate, 
citrate, tartrate, ascorbate, succinate, maleate, fumarate, gluconate, formate, benzoate, 
methanesulphonate, ethanesulphonate, benzenesulphonate and p-toluenesulphonate 
salts. 

When one or more acidic moieties are present, suitable pharmaceutically acceptable 
base addition salts can be formed from bases which form non-toxic salts and include 
the aluminium, calcium, lithium, magnesium, potassium, sodium, zinc, and 
pharmaceutically-active amines such as diethanolamine, salts. 

A pharmaceutically acceptable salt of a modulator of ACE may be readily prepared 
by mixing together solutions of the modulator of ACE and the desired acid or base, as 
appropriate. The salt may precipitate from solution and be collected by filtration or 
may be recovered by evaporation of the solvent. 

The modulator of ACE may exist in polymorphic form. 

The modulator of ACE may contain one or more asymmetric carbon atoms and 
therefore exists in two or more stereoisomeric forms. Where a modulator of ACE 
contains an alkenyl or alkenylene group, cis (E) and trans (Z) isomerism may also 
occur. The present invention includes the individual stereoisomers of the modulator 
of ACE and, where appropriate, the individual tautomeric forms thereof, together with 
mixtures thereof. 

Separation of diastereoisomers or cis and trans isomers may be achieved by 
conventional techniques, e.g. by fractional crystallisation, chromatography or 
H.P.L.C. of a stereoisomeric mixture of the modulator of ACE or a suitable salt or 
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derivative thereof. An individual enantiomer of the modulator of ACE may also be 
prepared from a corresponding optically pure intermediate or by resolution, such as 
by H.P.L.C. of the corresponding racemate using a suitable chiral support or by 
fractional crystallisation of the diastereoisomeric salts formed by reaction of the 
5 corresponding racemate with a suitable optically active acid or base, as appropriate. 

The modulator of ACE may also include all suitable isotopic variations of the 
modulator of ACE or a pharmaceutically acceptable salt thereof. An isotopic 
variation of an modulator of ACE or a pharmaceutically acceptable salt thereof is 

10 defined as one in which at least one atom is replaced by an atom having the same 
atomic number but an atomic mass different from the atomic mass usually found in 
nature. Examples of isotopes that can be incorporated into the modulator of ACE and 
pharmaceutically acceptable salts thereof include isotopes of hydrogen, carbon, 
nitrogen, oxygen, phosphorus, sulphur, fluorine and chlorine such as 2 H, 3 H, 13 C, 14 C, 

15 15 N, n O, 18 0, 31 P, 32 P, 35 S, 18 F and 36 C1, respectively. Certain isotopic variations of 
the modulator of ACE and pharmaceutically acceptable salts thereof, for example, 
those in which a radioactive isotope such as 3 H or 14 C is incorporated, are useful in 
drug and/or substrate tissue distribution studies. Tritiated, i.e., 3 H, and carbon- 14, i.e., 
C, isotopes are particularly preferred for their ease of preparation and detectability. 

20 Further, substitution with isotopes such as deuterium, i.e., 2 H, may afford certain 
therapeutic advantages resulting from greater metabolic stability, for example, 
increased in vivo half-life or reduced dosage requirements and hence may be preferred 
in some circumstances. Isotopic variations of the modulator of ACE and 
pharmaceutically acceptable salts thereof of this invention can generally be prepared 

25 by conventional procedures using appropriate isotopic variations of suitable reagents. 

It will be appreciated by those skilled in the art that the agent may be derived from a 
prodrug. Examples of prodrugs include entities that have certain protected group(s) 
and which may not possess pharmacological activity as such, but may, in certain 
30 instances, be administered (such as orally or parenterally) and thereafter metabolised 
in the body to form the modulator of ACE which is pharmacologically active. 

It will be further appreciated that certain moieties known as "pro-moieties", for 
example as described in "Design of Prodrugs" by H. Bundgaard, Elsevier, 1985 (the 
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disclosured of which is hereby incorporated by reference), may be placed on 
appropriate functionalities of the modulator of ACE. Such prodrugs are also included 
within the scope of the invention. 

PHARMACEUTICAL!, Y ACTIVE SALT 

The modulator of ACE may be administered as a phannaceutically acceptable salt. 
Typically, a phannaceutically acceptable salt may be readily prepared by using a 
desired acid or base, as appropriate. The salt may precipitate from solution and be 
collected by filtration or may be recovered by evaporation of the solvent. 

CHEMICAL SYNTHESIS METHODS 

The modulator of ACE of the present invention may be prepared by chemical 
synthesis techniques. 

It will be apparent to those skilled in the art that sensitive functional groups may need to 
be protected and deprotected during synthesis of a compound of the invention. This may 
be achieved by conventional techniques, for example as described in "Protective Groups 
in Organic Synthesis" by T W Greene and PGM Wuts, John Wiley and Sons Inc. 
(1991), and by P.J.Kocienski, in "Protecting Groups", Georg Thieme Verlag (1994). 

It is possible during some of the reactions that any stereocentres present could, under 
certain conditions, be racemised, for example if a base is used in a reaction with a 
substrate having an having an optical centre comprising a base-sensitive group. This is 
possible during e.g. a guanylation step. It should be possible to circumvent potential 
problems such as this by choice of reaction sequence, conditions, reagents, 
protection/deprotection regimes, etc. as is well-known in the art. 

The compounds and salts may be separated and purified by conventional methods. 

Separation of diastereomers may be achieved by conventional techniques, e.g. by 
fractional crystallisation, chromatography or H.P.L.C. of a stereoisomeric mixture of 
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a compound of formula (I) or a suitable salt or derivative thereof. An individual 
enantiomer of a compound of formula (I) may also be prepared from a corresponding 
optically pure intermediate or by resolution, such as by H.P.L.C. of the corresponding 
racemate using a suitable chiral support or by fractional crystallisation of the 
5 diastereomeric salts formed by reaction of the corresponding racemate with a suitably 
optically active acid or base. 

ACE, modulators of ACE or variants, homologies, derivatives, fragments or mimetics 
thereof may be produced using chemical methods to synthesise the ACE or the 

10 modulator of ACE in whole or in part. For example, a ACE peptide or a modulator of 
ACE that is a peptide can be synthesised by solid phase techniques, cleaved from the 
resin, and purified by preparative high performance liquid chromatography (e.g., 
Creighton (1983) Proteins Structures And Molecular Principles, WH Freeman and Co, 
New York NY). The composition of the synthetic peptides may be confirmed by 

15 amino acid analysis or sequencing (e.g., the Edman degradation procedure; Creighton, 
supra). 

Synthesis of peptides (or variants, homologues, derivatives, fragments or mimetics 
thereof) may be performed using various solid-phase techniques (Roberge JY et al 

20 (1995) Science 269: 202-204) and automated synthesis may be achieved, for example, 
using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer) in accordance with the 
instructions provided by the manufacturer. Additionally, the amino acid sequences 
comprising the modulator of ACE, may be altered during direct synthesis and/or 
combined using chemical methods with a sequence from other subunits, or any part 

25 thereof, to produce a variant modulator of ACE. 

CHEMICAL MODIFICATION 

The modulator of ACE may be a chemically modified modulator of ACE. 

30 

The chemical modification of a modulator of ACE may either enhance or reduce 
interactions between the modulator of ACE and the target - such as hydrogen bonding 
interactions, charge interactions, hydrophobic interactions, van der Waals interactions 
or dipole interactions. 
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In one aspect, the modulator of ACE may act as a model (for example, a template) for 
the development of other compounds. 

5 PHARMACEUTICAL COMPOSITIONS 

The components may be administered alone but will generally be administered as a 
pharmaceutical composition — e.g. when the components are in a mixture with a 
suitable pharmaceutical excipient, diluent or carrier selected with regard to the 
10 intended route of administration and standard pharmaceutical practice. 

For example, the components can be administered in the form of tablets, capsules, 
ovules, elixirs, solutions or suspensions, which may contain flavouring or colouring 
agents, for immediate-, delayed-, modified-, sustained-, pulsed- or controlled-release 
15 applications. 

If the pharmaceutical is a tablet, then the tablet may contain excipients such as 
microcrystalline cellulose, lactose, sodium citrate, calcium carbonate, dibasic calcium 
phosphate and glycine, disintegrants such as starch (preferably com, potato or tapioca 
20 starch), sodium starch glycollate, croscarmellose sodium and certain complex 
silicates, and granulation binders - such as polyvinylpyrrolidone, 
hydroxypropylmethylcellulose (HPMC), hydroxypropylcellulose (HPC), sucrose, 
gelatin and acacia. Additionally, lubricating agents - such as magnesium stearate, 
stearic acid, glyceryl behenate and talc may be included. 

25 

Solid compositions of a similar type may also be employed as fillers in gelatin 
capsules. Preferred excipients in this regard include lactose, starch, a cellulose, milk 
sugar or high molecular weight polyethylene glycols. For aqueous suspensions and/or 
elixirs, the modulator of ACE may be combined with various sweetening or 
30 flavouring agents, colouring matter or dyes, with emulsifying and/or suspending 
agents and with diluents such as water, ethanol, propylene glycol and glycerin, and 
combinations thereof. 
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The routes for administration (delivery) may include, but are not limited to, one or 
more of oral (e.g. as a tablet, capsule, or as an ingestable solution), topical, mucosal 
(e.g. as a nasal spray or aerosol for inhalation), nasal, parenteral (e.g. by an injectable 
form), gastrointestinal, intraspinal, intraperitoneal, intramuscular, intravenous, 
5 intraventricular, intrauterine, intraocular, intradermal, intracranial, intratracheal, 
intravaginal, intracerebroventricular, intracerebral, subcutaneous, ophthalmic 
(including intravitreal or intracameral), transdermal, rectal, buccal, vaginal, epidural, 
sublingual. 

10 Pharmaceutical compositions of the present invention may comprise a therapeutically 
effective amount of ACE, one or more modulators of ACE or combinations thereof. 

The pharmaceutical compositions may be for human or animal usage in human and 
veterinary medicine and will typically comprise any one or more of a 

15 pharmaceutically acceptable diluent, carrier, or excipient. Acceptable carriers or 
diluents for therapeutic use are well known in the pharmaceutical art, and are 
described, for example, in Remington's Pharmaceutical Sciences, Mack Publishing 
Co. (A. R. Gennaro edit. 1985). The choice of pharmaceutical carrier, excipient or 
diluent can be selected with regard to the intended route of administration and 

20 standard pharmaceutical practice. The pharmaceutical compositions may comprise as 
- or in addition to - the carrier, excipient or diluent any suitable binder(s), 
lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s). 

Preservatives, stabilizers, dyes and even flavouring agents may be provided in the 
25 pharmaceutical composition. Examples of preservatives include sodium benzoate, 
sorbic acid and esters of p-hydroxybenzoic acid. Antioxidants and suspending agents 
may be also used. 

There may be different composition/formulation requirements dependent on the 
30 different delivery systems. By way of example, the pharmaceutical composition of 
the present invention may be formulated to be administered using a mini-pump or by 
a mucosal route, for example, as a nasal spray or aerosol for inhalation or ingestable 
solution, or parenterally in which the composition is formulated by an injectable form, 
for delivery, by, for example, an intravenous, intramuscular or subcutaneous route. 
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Alternatively, the formulation may be designed to be administered by a number of 
routes. 

If the modulator of ACE is to be administered mucosally through the gastrointestinal 
5 mucosa, it should be able to remain stable during transit though the gastrointestinal 
tract; for example, it should be resistant to proteolytic degradation, stable at acid pH 
and resistant to the detergent effects of bile. 

Where appropriate, the pharmaceutical compositions may be administered by 
10 inhalation, in the form of a suppository or pessary, topically in the form of a lotion, 
solution, cream, ointment or dusting powder, by use of a skin patch, orally in the form 
of tablets containing excipients such as starch or lactose, or in capsules or ovules 
either alone or in admixture with excipients, or in the form of elixirs, solutions or 
suspensions containing flavouring or colouring agents, or the pharmaceutical 
15 compositions can be injected parenterally, for example intravenously, intramuscularly 
or subcutaneously. For parenteral administration, the compositions may be best used 
in the form of a sterile aqueous solution which may contain other substances, for 
example enough salts or monosaccharides to make the solution isotonic with blood 
For buccal or sublingual administration the compositions may be administered in the 
20 form of tablets or lozenges which can be formulated in a conventional manner. 

The modulators of ACE may be used in combination with a cyclodextrin. 
Cyclodextrins are known to form inclusion and non-inclusion complexes with drug 
molecules. Formation of a drug-cyclodextrin complex may modify the solubility, 

25 dissolution rate, bioavailability and/or stability property of a drug molecule. Drug- 
cyclodextrin complexes are generally useful for most dosage forms and administration 
routes. As an alternative to direct complexation with the drug the cyclodextrin may 
be used as an auxiliary additive, e.g. as a carrier, diluent or solubiliser. Alpha-, beta- 
and gamma-cyclodextrins are most commonly used and suitable examples are 

30 described in WO-A-91/1 1 172, WO-A-94/02518 and WO-A-98/55148. 

If the modulator of ACE is a protein, then said protein may be prepared in situ in the 
subject being treated. In this respect, nucleotide sequences encoding said protein may 
be delivered by use of non-viral techniques (e.g. by use of liposomes) and/or viral 
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techniques (e.g. by use of retroviral vectors) such that the said protein is expressed 
from said nucleotide sequence. 

DOSE LEVELS 

Typically, a physician will determine the actual dosage which will be most suitable 
for an individual subject. The specific dose level and frequency of dosage for any 
particular patient may be varied and will depend upon a variety of factors including 
the activity of the specific compound employed, the metabolic stability and length of 
action of that compound, the age, body weight, general health, sex, diet, mode and 
time of administration, rate of excretion, drug combination, the severity of the 
particular condition, and the individual undergoing therapy. 

FORMULATION 

The components) may be formulated into a pharmaceutical composition, such as by 
mixing with one or more of a suitable carrier, diluent or excipient, by using 
techniques that are known in the art. 

VECTOR 

Aspects of the present invention relate to a vector comprising a nucleotide sequence - 
such as a nucleotide sequence encoding ACE or a modulator of ACE - administered to 
a subject. 

Preferably, ACE or the modulator of ACE is prepared and/or delivered using a genetic 
vector. 

As it is well known in the art, a vector is a tool that allows or facilitates the transfer of 
an entity from one environment to another. In accordance with the present invention, 
and by way of example, some vectors used in recombinant DNA techniques allow 
entities, such as a segment of DNA (such as a heterologous DNA segment, such as a 
heterologous cDNA segment), to be transferred into a host and/or a target cell for the 
purpose of replicating the vectors comprising nucleotide sequences and/or expressing 
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the proteins encoded by the nucleotide sequences. Examples of vectors used in 
recombinant DNA techniques include, but are not limited to, plasmids, chromosomes, 
artificial chromosomes or viruses. 

5 The term "vector" includes expression vectors and/or transformation vectors. 

The term "expression vector" means a construct capable of in vivo or in vitro/ex vivo 
expression. 

10 The term "transformation vector" means a construct capable of being transferred from 
one species to another. 

REGULATORY SEQUENCES 

15 In some applications, nucleotide sequences are operably linked to a regulatory 
sequence which is capable of providing for the expression of the nucleotide sequence, 
such as by a chosen host cell. By way of example, a vector comprising the ACE 
nucleotide sequence is operably linked to such a regulatory sequence i.e. the vector is 
an expression vector. 

20 

The term "operably linked" refers to a juxtaposition wherein the components 
described are in a relationship permitting them to function in their intended manner. 
A regulatory sequence "operably linked" to a coding sequence is ligated in such a way 
that expression of the coding sequence is achieved under conditions compatible with 
25 the control sequences. 

The term "regulatory sequences" includes promoters and enhancers and other 
expression regulation signals. 

30 The term "promoter" is used in the normal sense of the art, e.g. an RNA polymerase 
binding site. 

Enhanced expression of a nucleotide sequence, for example, a nucleotide sequence 
encoding ACE - may also be achieved by the selection of heterologous regulatory 
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regions, e.g. promoter, secretion leader and terminator regions, which serve to 
increase expression and, if desired, secretion levels of the protein of interest from the 
chosen expression host and/or to provide for the inducible control of the expression of 
ACE. In eukaryotes, polyadenylation sequences may be operably connected to the 
5 ACE nucleotide sequence. 

Preferably, the ACE nucleotide sequence is operably linked to at least a promoter. 

Aside from the promoter native to the gene encoding the ACE nucleotide sequence, 
10 other promoters may be used to direct expression of the ACE polypeptide. The 
promoter may be selected for its efficiency in directing the expression of the ACE 
nucleotide sequence in the desired expression host. 

In another embodiment, a constitutive promoter may be selected to direct the 
15 expression of the ACE nucleotide sequence of the present invention. Such an 
expression construct may provide additional advantages since it circumvents the need 
to culture the expression hosts on a medium containing an inducing substrate. 

Hybrid promoters may also be used to improve inducible regulation of the expression 
20 construct. 

The promoter can additionally include features to ensure or to increase expression in a 
suitable host. For example, the features can be conserved regions such as a Pribnow 
Box or a TATA box. The promoter may even contain other sequences to affect (such 
25 as to maintain, enhance, decrease) the levels of expression of the ACE nucleotide 
sequence. For example, suitable other sequences include the Shl-intron or an ADH 
intron. Other sequences include inducible elements - such as temperature, chemical, 
light or stress inducible elements. Also, suitable elements to enhance transcription or 
translation may be present. 

30 

EXPRESSION VECTOR 



Preferably, nucleotide sequences - such as nucleotide sequences encoding ACE or 
modulators of ACE - are inserted into a vector that is operably linked to a control 
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sequence that is capable of providing for the expression of the coding sequence by the 
host cell. 

Nucleotide sequences produced by a host recombinant cell may be secreted or may be 
5 contained intracellular^ depending on the sequence and/or the vector used. As will 
be understood by those of skill in the art, expression vectors can be designed with 
signal sequences, which direct secretion of the nucleotide sequence through a 
particular prokaryotic or eukaryotic cell membrane. 

10 The expression vector may be pEE-tACEA36NJ which encodes human tACE that 
lacks the heavily 0-glycosylated, 36-residue N-terminal sequence and is truncated after 
Ser , thereby lacking most of the juxtamembrane stalk as well as the transmembrane 
and cytoplasmic domains. 

15 In a preferred aspect, the expression vector is pLEN- tACEA36g(n), where n is a set of 
numbers defining the available iV-linked glycosylation sites. 

Preferably, the expression vectors are stably expressed in CHO cells as described 
previously (Ehlers et ah (1996) Biochemistry 35, 9549-9559). More preferably, the 
20 expression vectors are pLEN- tACEA36g(l , 2, 3, 4) and pLEN- tACEA36g(l ,3). 

FUSION PROTEINS 

ACE or a modulator of ACE may be expressed as a fusion protein to aid extraction 
25 and purification and/or delivery of the modulator of ACE or the ACE protein to an 
individual and/or to facilitate the development of a screen for modulators of ACE. 

Examples of fusion protein partners include glutathione-S-transferase (GST), 6xHis, 
GAL4 (DNA binding and/or transcriptional activation domains) and (5-galactosidase. 

30 

It may also be convenient to include a proteolytic cleavage site between the fusion 
protein partner and the protein sequence of interest to allow removal of fusion protein 
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sequences. Preferably, the fusion protein will not hinder the activity of the protein of 
interest. 

The fusion protein may comprise an antigen or an antigenic determinant fused to the 
substance of the present invention. In this embodiment, the fusion protein may be a 
non-naturally occurring fusion protein comprising a substance, which may act as an 
adjuvant in the sense of providing a generalised stimulation of the immune system. 
The antigen or antigenic determinant may be attached to either the amino or carboxy 
terminus of the substance. 

ORGANISM 

The term "organism" in relation to the present invention includes any organism that 
could comprise ACE and/or modulators of ACE. Examples of organisms may include 
mammals, fungi, yeast or plants. 

Preferably, the organism is a mammal. More preferably, the organism is a human. 
TRANSFORMATION 

As indicated earlier, the host organism can be a prokaryotic or a eukaryotic organism. 
Examples of suitable prokaryotic hosts include E. colt and Bacillus subtilis. Teachings 
on the transformation of prokaryotic hosts are well documented in the art, for example 
see Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd edition, 1989, Cold 
Spring Harbor Laboratory Press) and Ausubel et al 9 Current Protocols in Molecular 
Biology (1995), John Wiley & Sons, Inc. Examples of suitable eukaryotic hosts 
include mammalian cells. 

If a prokaryotic host is used then the nucleotide sequence - such as the ACE nucleotide 
sequence - may need to be suitably modified before transformation - such as by removal 
ofintrons. 

Thus, the present invention also relates to the transformation of a host cell with a 
nucleotide sequence - such as ACE or a modulator of ACE. Host cells transformed. 
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with the nucleotide sequence may be cultured under conditions suitable for the 
expression and recovery of the encoded protein from cell culture. The protein 
produced by a recombinant cell may be secreted or may be contained intracellularly 
depending on the sequence and/or the vector used. As will be understood by those of 
5 skill in the art, expression vectors containing coding sequences can be designed with 
signal sequences which direct secretion of the coding sequences through a particular 
prokaryotic or eukaryotic cell membrane. Other recombinant constructions may join 
the coding sequence to nucleotide sequence encoding a polypeptide domain, which 
will facilitate purification of soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 
10 12:441-53) e.g. 6-His or Glutathiones-transferase; 

TRANSFECTION 

Vectors comprising for example, the ACE nucleotide sequence, may be introduced 
15 into host cells, for example, mammalian cells, using a variety of methods. 

Typical transfection methods include electroporation, DNA biolistics, lipid-mediated 
transfection, compacted DNA-mediated transfection, liposomes, immunoliposomes, 
lipofectin, cationic agent-mediated, cationic facial amphiphiles (CFAs) (Nature 
20 Biotech. (1996) 14, 556), multivalent cations such as spermine, cationic lipids or 
polylysine, 1, 2,-bis (oleoyloxy)-3-(trimethylarnmonio) propane (DOTAP)-cholesterol 
complexes (Wolff and Trubetskoy 1998 Nature Biotechnology 16: 421) and 
combinations thereof. 

25 Uptake of nucleic acid constructs by mammalian cells is enhanced by several known 
transfection techniques for example those including the use of transfection agents. 
Example of these agents include cationic agents (for example calcium phosphate and 
DEAE-dextran) and lipofectants (for example lipofectam™ and transfectam™). 
Typically, nucleic acid constructs are mixed with the transfection agent to produce a 

30 composition. 

Such methods are described in many standard laboratory manuals - such as Sambrook 
et a!., Molecular Cloning: A Laboratory Manual, 2d ed. (1989) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, N. Y. 
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GENERAL RECO MBINANT DNA METHODOLOGY TECHNIQUES 

The present invention employs, unless otherwise indicated, conventional techniques of 
5 chemistry, molecular biology, microbiology, recombinant DNA and immunology, which 
are within the capabilities of a person of ordinary skill in the art. Such techniques are 
explained in the literature. See, for example, J. Sambrook, E. F. Fritsch, and T. Maniatis, 
1989, Molecular Cloning: A Laboratory Manual, Second Edition, Books 1-3, Cold 
Spring Harbor Laboratory Press; AusubeL F. M. et al. (1995 and periodic supplements; 

10 Current Protocols in Molecular Biology, ch. 9, 13, and 16, John Wiley & Sons, New 
York, N.Y.); B. Roe, J. Crabtree, and A. Kahn, 1996, DNA Isolation and Sequencing: 
Essential Techniques, John Wiley & Sons; J. M. Polak and James O'D. McGee, 1990, In 
Situ Hybridization: Principles and Practice; Oxford University Press; M. J. Gait 
(Editor), 1984, Oligonucleotide Synthesis: A Practical Approach, Lrl Press; and, D. M. J. 

15 Lilley and J. E. Dahlberg, 1992, Methods of Enzymology: DNA Structure Part A: 
Synthesis and Physical Analysis of DNA Methods in Enzymology, Academic Press. Each 
of these general texts is herein incorporated by reference. 

DESCRIPTION OF THE DRAWINGS 

20 

Figure 1 is a schematic presentation of wild-type tACE (tACE) and tACEA36 
glycosylation mutants. Glycosylation sites are shown with boxes, glycosylated 
always (black box), glycosylated partially (grey box) and unglycosylated (open box). 
The O-glycosylated region at the N-terminus of (tACE), which is absent in tACEA36, 
25 is shown with a stippled box. TM is the transmembrane domain. 

Figure 2 illustrates the expression of tACE glycosylation mutants. Proteins are 
immunodetected from detergent-solubilised cells (A) and from harvested medium (B) 
with rabbit anti-human tACE antibody (at 1:2000). The estimated protein size is 
30 indicated. Lanes contain tACEA36 (A36), untreated CHO cells (CHO) and ACE 
glycosylation mutants tACEA36-g(n) where n equals the number of sites that are 
glycosylated. 
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Figures 3 illustrates the effect of tunicamycin on the expression and processing of 
tACE. CHO cells expressing wild-type tACE are treated with the glycosylation 
inhibitor tunicamycin for up to 24 h. Cell lysate from untreated (a) and treated (b) 
cells are analysed at the indicated times by Western blotting, protein molecular 
5 weights are given. 

Figure 4 illustrates the effect of phorbol ester on the levels of ACE activity. Results 
are expressed as percentage soluble of total (soluble plus cell associated) ACE 
activity. Lanes show wild-type tACE (tACE), tACEA36 (A36) and glycosylation 
10 mutants tACEA36 g(n), where n gives the number of sites that are glycosylated. CHO 
cells were grown either in the absence (open bars) or the presence (filled bars) of 
phorbol 12, 13 ^dibutyrate. 

Figure 5 represents the orthorhombic crystals of tACEA36. 

15 

Figure 6 schematically illustrates a general purpose computer (132) of the type that may 
be used in accordance with the present invention. The computer (132) includes a central 
processing unit (134), a read only memory (136), a random access memory (138), a hard 
disk drive (140), a display driver (142) and display (144) and a user input/output circuit 

20 (146) with a keyboard (148) and mouse (150) all connected via a common bus (152). 
The central processing unit (134) may execute program instructions stored within the 
ROM (136), the RAM (138) or the hard disk drive (140) to carry out processing of signal 
values that may be stored within the RAM (138) or the hard disk drive (140). The 
program may be written in a wide variety of different programming languages. The 

25 computer program itself may be stored and distributed on a recording medium, such as a 
compact disc, or may be downloaded over a network link (not illustrated). The general 
purpose computer (132) when operating under control of an appropriate computer 
program effectively forms an apparatus for performing aspects of the present invention. 

30 Figure 7 illustrates the overview of the testis ACE structure, (a) Ribbon diagram of 
the molecule looking down on the active site. The molecule can be divided into 2 
halves as domains I and II shown in cyan and pink respectively. The active site zinc 
ion and the lisinopril molecule are shown in yellow. The two chloride ions are shown 
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as black spheres, (b) The molecular surface representation showing the active site 
groove, (c) The structure-sequence relationship in ACE. The secondary structure 
elements (domain I-cyan; domain II-pink) follow the same colour code as in (a). The 
important residues which are involved in binding are marked- zinc ligands (yellow), 
5 chloride binding residues (Cll- light green, C12- dark green), lisinopril binding 
residues (cyan) and glycosylation sites (open boxes). 

Figure 8 illustrates details of the active site, (a) Zinc co-ordination in the native 
structure. Bound zinc ion, acetate ion (AC) and part of the butyl succinate moiety 
10 (LIG) are shown, (b) Details of lisinopril (LIS) interactions. Bound zinc ion is also 
shown. 

Figure 9 illustrates the comparison of (a) ACE and (b) Neurolysin folds. The same 
view as in Figure 7a has been retained. 

15 

The invention will now be further described by way of Examples, which are meant to 
serve to assist one of ordinary skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. 

20 EXAMPLES 

Example 1 

Materials & Methods 

25 

Materials 

Endoproteinase Lys-C and Asp-N, peptide-JV-glycosidase F (PNGase F), 
endoglycosidase H (endo H), neuraminidase, and O-glycosidase are purchased from 
Roche Biochemicals. Cyanogen bromide, trifluoroacetic acid, and calibration 
30 standards (angiotensin, insulin, myoglobin, oxidised insulin B-chain, and TPCK- 
treated trypsin) are from Sigma Chemical Co. Glycosylation inhibitor N- 
butyldeoxynojirimycin (NB-DNJ) was a kind gift from Dr. F. Piatt, University of 
Oxford, UK. 
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Construction of Expression Vectors 

pEE-ACEA36NJ encodes human tACE that lacks the heavily 0-glycosylated, 36- 
residue N-terminal sequence and is truncated after Ser 625 , thereby lacking most of the 
juxtamembrane stalk as well as the transmembrane and cytoplasmic domains, and is 

5 constructed as follows. The 5' half of the ACE cDNA in the plasmid pLEN-ACE- 
JMA24 is excised by digestion with BamHI and Nhel and replaced with the similarly 
digested fragment from plasmid pLEN-ACEA36N. pLEN-ACE-JMA24 has an 
engineered EcoRI site at nucleotide (nt) 1984 in the ACE cDNA. The sequence 
between nt 1854 (the start of the unique Bell site) and nt 1990 (the end of the codon 

10 for Ser 625 ) in the native ACE cDNA are amplified by the polymerase chain reaction, 
using a 3 ! primer that contains two stop codons (TAA and TAG) after the Ser 625 
codon, followed by an EcoRI site. The recombinant sequence is inserted into the 
pLEN-ACE_36N/JMA24 hybrid cut with Bell and EcoRI, to generate pLEN- 
ACEA36NJ. The ACEA36NJ coding sequence is excised by digestion of unique Xbal 

15 (generated after first subcloning in pBluescript) and EcoRI sites and inserted into the 
polylinker of the expression vector pEE14, to generate pEE-ACEA36NJ. 

Cell Culture and Transfections 

CHO-K1 cells stably transfected with pLEN-ACE glycosylation mutants are grown 
20 and maintained in standard media (50% Ham's F-12/50% DME medium 
supplemented with 20 mM Hepes, pH 7.3) containing 2% fetal bovine serum (heated 
to 65 °C for 15 mins before use) and 40 pM Zn Cl 2 . In addition, native CHO-K1 cells 
are cotransfected with pEE-ACEA36NJ (10 jig) and pSV2NEO (1 ng) by the calcium 
phosphate precipitate method and clones stably resistant to G418 (Geneticin, Gibco- 
25 BRL) are selected and assayed for ACE activity, by procedures detailed previously 
(9,1 1). Clones stably expressing pEE-ACEA36NJ are further selected for resistance to 
methionine sulfoximine and then amplified, as described (Davis, SJ, Davies, EA, 
Barclay, AN, Daenke, S, Bodian, DL, Jones, EY, Stuart, DI, Butters, TD, Dwek, RA, 
and van der Merwe, PA (1995) J Biol Chem 270, 369-375). Methionine sulfoximine- 
30 amplified cells are grown first in GMEM-10 (Gibco-BRL) containing 10% dialyzed 
fetal bovine serum (FBS) (Gibco-BRL) and 1.5 mM NB-DNJ for 3 days and then 
refed with GMEM-10, 5% dialyzed FBS, 2 mM NB-DNJ. This medium is changed 
twice over a period of 9 days before harvesting. 
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Enzyme Purification 

Soluble, recombinant tACE (wild-type, ACEA36NJ and ACE glycosylation mutants), 
is purified from conditioned media by affinity chromatography on a Sepharose-28- 
lisinopril affinity resin. The protein is quantitated by amino acid analysis and assayed 
5 for activity using the substrate hippuryl-L-Mstidyl-L-leucine , as described (Ehlers, 
MRE, Chen, Y -N, Riordan, JF (1991) Proc Natl Acad Sci USA 88, 1009-1013). 

Deglycosylation of ACE 

tACEA36NJ (12.5 nmol) purified from cultures treated with NB-DNJ is digested with 
10 endo H (30 mU) in 100 mM sodium phosphate, 0.1 mM ZnCl 2 , 1% BSA, pH 6.0 for 
16 h at 37°C. The endo H-treated ACE is passed through a lectin affinity column 
consisting of equal parts of concanavalin A, wheat germ, and lentil lectin, after 
equilibration with 20 mM Tris-HCl, 0.5 M NaCl at pH 7.4. The deglycosylated ACE 
is collected in the flowthrough. Free oligosaccharides and any other impurities are 
15 removed from the flowthrough fraction by a final lisinopril-Sepharose affinity 
chromatography step. The homogeneity of the tACEA36NJ after deglycosylation is 
confirmed by SDS-PAGE on a 4-20% acrylamide gel and MALDI-TOF mass 
spectrometry. 

20 Construction of Glycosylation Mutants 

Minimally glycosylated isoforms of human testis ACE are constructed by the removal 
of a combination of iV-linked glycosylation sites. Glycosylation is abolished by 
site-directed mutagenesis of the site of attachment in the recognition sequon 
(Asn-X-Ser/Thr) through a conserved transition of the Asn to a Gin residue. A 

25 truncated form of tACE (tACEA36 lacking the first N-terminal 36 residues is used for 
the construction of all mutants. tACEA36 cDNA is divided into four fragments and 
introduced into pGEM-HZf(+) to facilitate site-directed mutagenesis. An Eco47JH 
site is introduced into pGEM-llZf(+) for cloning of the first and second tACEA36 
fragments. A 146 bp BamBl/Nhel digested pBR329 fragment that contains an 

30 Eco47I& site is ligated to BamHUXbal digested pGEM. The following mutagenic 
oligonucleotides are used for altering the consensus sites for N-linked glycosylation: 
gl-F, g2-F, g3-F, g4-F, g5-F, g6-F, gl-R, g2-R and g3-R, with the number referring to 
each complementary recognition sequon. The mutagenic oligonucleotides encode the 
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Asn to Gin transition (underlined) at the glycosylation sites and a silent mutation that 
alters a restriction enzyme recognition site (highlighted in italics). Mutated residues 
are indicated by bold type: 

5 Forward primers: 

gl-F (40-mer): 5 , -GAGGCC^477X7GAACTACAACACCCAGATCACCACAGAG-3 , 
g2-F(36-mer): 5 , -ATGCAAATAGCCCAGCACACCC7T^4GTACGGCACC-3 , 
g3-F (40-mer): S'-GAAGTrrGATGTTAACCAGTTGCAGCAGACCACTATCAAG^ 
1° g4-F (30-mer): 5 , -GTGTGCCACCCGCAAGG7MGC^GCCTGCAG-3 , 

g5-F (36-mer): 5 , -CCGTGCCTCCTG^47TCTGGCAGAAGTCGATGCTGG-3 , 
g6-F (36-mer): 5 , -ACGGGCCAGCCCCAGATa4GCGCJTCGGCC-3 , 

Reverse primers are complementary to the forward primers: 

15 

gl-R (40-mer): S-CTCTGTGGTGATCTGGGTGTTGTAGTTCCAATTGGCCTCG-S 1 
g2-R (36-mer): 5 , -GGTGCCGTACTTAAGGGTGTGCTGGGCTATTTGCAT-3 , 
g3-R (40-mer): 5 f -CTTGATAGTGGTCTGCTGCAACTGGTTAACATCAAACTTC-3 , 

20 In the first fragment a cDNA-containing mutation at one site served as a template for 
the mutagenesis of the second site thus generating five variants with either one of two 
sites out of the first three eliminated. The fourth, fifth and sixth sites (N155, N337 and 
N586) are eliminated from fragments 2, 3 and 4 respectively using primers g4-F, 
g5-F, g6-F. The nucleotide sequence of each fragment is confirmed by sequencing to 

25 ensure that only the desired mutation has been created. The four fragments are 
reassembled to produce eight tACEA36 glycosylation mutants (Fig 1). 

The mutants are introduced into the mammalian expression vector, pLEN, to facilitate 
the production of underglycosylated tACEA36 protein. The expression vectors pLEN- 

30 tACEA36g(n), where n is a set of numbers defining the available A^-linked 
glycosylation sites, are stably expressed in CHO cells as described previously (Ehlers 
et aL (1996) Biochemistry 55, 9549-9559). Similarly, wild-type tACE (tACE-wt), 
retaining its 36-residue N-terminus as well as the transmembrane region and 
juxtamembrane stalk, is stably expressed in CHO cells. A deletion mutant, 

35 tACEA36NJ, truncated after Ser625, and thus lacking the cytoplasmic and TM 
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domains as well as most of the juxtamembrane stalk, is constructed (Yu et al. (1997) 
J. Biol. Chem. 272, 3511-3519). The vector pEE14-ACEA36NJ is transfected into 
CHO cells and clones stably expressing the mutant are amplified using methionine 
sulfoxamine, as described (Yu etal. (1997) J. Biol. Chem. 272, 3511-3519). 

5 

Inhibition oftACE glycosylation in presence oftunicamycin 

CHO cells expressing tACE-wt are grown to confluence in 6-well plates and induced 
overnight in 2 % medium in the absence and presence of 5 ug/ml tunicamycin. Fresh 
medium is added and cells are grown in the absence and presence of 5 ug/ml 
10 tunicamycin. At 6, 8, 12, 16, 20 and 24 hours detergent-solubilised cell samples are 
collected. After separating the proteins on 10 % SDS-PAGE they are immunodetected 
by Western blotting. 

Western Blot Analysis of Glycosylation Mutants 

15 tACE-wt, tACEA36, tACEA36-gl, -g2, -g3, -gl2, -gl3, -g23, -gl23 and -gl234 are 
immunodetected by Western blotting of cell lysates and harvested medium from 
transfected CHO cells. Proteins are separated on 10 % SDS-PAGE and transferred to 
nitrocellulose membrane (Hybond-C, Amersham). The membrane is probed with a 
polyclonal rabbit anti-human tACE antibody. The membrane is developed using the 

20 ECL chemilurninescence kit (Amersham) and visualised on autoradiographic film, as 
per instructions. 

Analysis of Release Kinetics and Cleavage Sites 

After selection for stable transfectants, kinetic analyses of rates of accumulation of 
25 soluble (released) ACE activity and changes in membrane-bound ACE activity are 
performed in the presence and absence of 1 uM phorbol 12,13 dibutyrate (Schwager 
et al. (1998) Biochemistry 37, 15449-15456). Identification of the stalk cleavage sites 
and analysis of the C-terminal glycosylation site in the released (soluble) protein is 
carried out using limited proteolysis and MALDI-TOF mass spectrometry by methods 
30 described previously (Schwager et al. (1998) Biochemistry 37, 15449-15456). 

Determination of kinetic constants for tACE hydrolysis of Hippuryl-L-histidyl-L- 
leucine (Hip-His-Leu) 
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The enzyme is purified from harvest medium via lisinopril affinity chromatography 
and rates of substrate hydrolysis are determined. Assays are performed in 100 mM 
potassium phosphate pH 8.3 containing 300 mM NaCl. Initial velocities are 
calculated over a range of Hip-His-Leu concentrations (0.2-5.0 mM) under initial rate 
5 conditions and fit to the Michaelis-Menten equation. K m and V max values are 
determined by non-linear regression analysis. Turnover numbers (^ at ) and specificity- 
constants (host/Km) are determined using a calculated molecular mass of 100 kDa. 

Mass Spectrometry 

10 Mass spectrometry is used to verify the glycan occupancy of some of the 
glycosylation sites of the mutant proteins. All mass spectra are obtained on a 
MALDI/TOF/MS instrument (Voyager-Elite Biospectrometry Workstation, 
PerSeptive Biosystems, Inc.). A nitrogen laser (337 nm) is used for desorption 
ionization. Measurements are carried out either in the linear or reflectron mode with 

15 mass accuracies of 0.1% and 0.01%, respectively. Spectra are collected over a 
hundred laser shots. 

Typical matrices used in these experiments are 3,5-d^emoxy-4-hydroxycinnamic 
acid (sinapinic acid) and a-cyano-4-hydroxycinnamic acid (Aldrich). About 1 pi of 
20 sample solution was mixed with 2 pi of the matrix solution (10 mg/ml in 50% v/v 
CH 3 CN and H 2 0). A 0.5 pi volume (containing 1-10 pmol of peptide or peptide 
mixture) of the above solution is loaded on the sample plate and allowed to dry. 

Crystallisation 

25 The purified tACE- ACEA36NJ and ACE glycosylation mutants are stored at -20 °C 
in 10 mM HEPES and 0.1% PMSF. Extensive crystallisation trials using 
commercially available crystal screen conditions (Hampton Research) are tried. In 
addition ammonium sulphate, PEG and MPD matrices are also tried. The best crystals 
for these proteins are grown at 1 6 °C by the vapour diffusion hanging drop method by 

30 mixing 2 pi of the protein solution at ~1 1.5 mg/ml in 10 mM HEPES and 0. 1% PMSF 
with an equal volume of a reservoir solution containing 15 % PEG 4000 (Fluka), 50 
mM CH 3 COONa.3H 2 0 (Sigma Chemical Company) pH 4.7 and 10 pM ZnS0 4 .7H 2 O 
(Aldrich Chemical Company). Crystals usually appear within 2 weeks and grow to 
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their maximum size after about a month. The crystals belong to P2i2i2i space group, 
with 1 molecule in the crystallographic asymmetric unit and some 49 % of the crystal 
volume occupied by the solvent (cell dimensions: a= 56.47 A, b=84.90 A, c=133.99 
A, <x=90°, 0=90° and y=90°). tACE-ACEA36NJ crystals diffract to 2 A while tACE- 
5 ACEA36NJ-gl3 and tACE- ACEA36NJ-gl234 crystals diffract to 3.0 A and 2.8 A 
respectively on a Synchrotron Radiation Source. All crystals are isomorphous. 

The tACE-lisinopril (lisinopril.dihydrate) inhibitor complex is obtained by growing 
the crystals in the presence of inhibitor. In these experiments the protein solution is 
10 mixed with 10 mM of the inhibitor and mixed with an equal volume of the reservoir 
solution before setting up the crystallisation. Single crystals belonging to the P2i2i2i 
orthorhombic space group (isomorphous with the native crystals) suitable for 
diffraction work appear after about 4 weeks. 

15 X-ray diffraction data collection 

Before data collection, all crystals are flash-cooled at 100 K in a cryoprotectant 
containing 15 % PEG 4000, 50 mM CH 3 COONa.3H 2 0 pH 4.7, 10 jjM ZnS0 4 .7H 2 0 
and 25% glycerol with and without respective entities - such as ACE inhibitors. All 
the X-ray data are collected at 100 °K using a Synchrotron Radiation Source (ESRF- 

20 Grenoble, France). Multi-wavelength anomalous dispersion (MAD) data are 
collected with crystals of tACE-lisinopril complex at peak of Zn K-edge (1.2825 A), 
inflection (1.28322 A) and remote (0.95373 A) wavelengths. An anomalous dataset at 
long wavelength (1.7712 A), the closest possible wavelength to Sulphur K-edge is 
also collected. All other heavy atom data sets are collected at the SRS-Daresbury 

25 (UK) and processed with anomalous signal. Out of more than 32 soaks with different 
heavy atoms, only three are found to be useful in phasing. These three (Pt, Pd & Os) 
derivatives are prepared by soaking the tACE-lisinopril complex crystals for - 10 to 
60 minutes in the presence of 1-5 mM heavy atom solutions. Raw data images are 
indexed and scaled using the DENZO and SCALEPACK modules of the HKL suite 

30 [Otwinowski M and Minor W. (1997) Methods Enzymol 276, 307-326]. 

Example 2 
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Expression of glycosylation mutants in CHO cells and the kinetics of release 

The role of JV-linked glycosylation in the expression and processing of tACEA36 is 
investigated to establish the minimal glycosylation requirements for the expression of 
5 correctly folded enzymatically-active protein. The eight glycosylation mutants 
constructed are transfected into CHO cells and the effect of deglycosylation on the 
activity, expression and processing of tACEA36 is assessed by enzyme assays, 
immunodetection and cleavage site analysis. 

10 Western blot analysis of cellular and soluble tACEA36 glycoforms reveal proteins 
with increased mobility relative to glycosylated wild type tACE (tACE-wt) (Fig 2) 
and which is dependent upon on their degree of glycosylation. Furthermore, 
differences are detected in the expression and processing of the various glycoforms. 
Comparatively low levels of tACEA36-g2 protein in cellular extracts (Fig. 2A), and 

15 its absence in the harvest medium (Fig. 2B), may suggest intracellular degradation 
and/or deficient processing of the protein. Whilst tACEA36-gl demonstrates 
comparable levels of expression to the other proteins (Fig. 2A), processing and 
solubilisation appears to be less efficient than tACEA36-g3, -gl23, and -gl234 (Fig. 
2B). Tunicamycin partially inhibits the formation of mature, glycosylated tACE-wt. 

20 Western blot analysis of cellular tACE-wt in the absence of tunicamycin reveals a 
105 kDa mature, glycosylated protein (Fig 3). In the presence of tunicamycin an 
additional smaller, deglycosylated 85 kDa protein is detected (Fig 3). 

The K m and Abat values obtained for the hydrolysis of Hip-His-Leu by tACE (Table 3) 
25 are in agreement with those previously published for the C-fragment of the human 
endothelial isoform (K m = 2.0 mM) (10). Differences in the kinetic constants of 
glycosylated and under-glycosylated tACEA36 mutants are negligible and are not 
considered sufficiently different to reflect major alterations in the conformation and 
activity of the protein. Glycosylation of tACEA36 at one (tACEA36-g3) or two 
30 (tACEA36-gl3) N-terminal glycosylation sequons is sufficient in maintaining the 
functional integrity of the enzyme. Furthermore, treatment of cells expressing 
tACE A3 6N J with the glucosidase I inhibitor NB-DNJ, do not alter the kinetic 
properties of the expressed enzymes. 
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Example 3 

Kinetics of release and analysis of juxtamembrane cleavage sites 

5 

Independent transfections of CHO cells with tACEA36-gl, -gl2, -gl23 and -gl234 
are performed which yield consistent results for each mutant (Fig. 4). All 
glycosylation mutants show similar levels of phorbol ester stimulation. Thus, N- 
linked glycosylation does not appear to affect solubilisation of the membrane- 

10 anchored enzyme. Further protein analysis is performed to: 1) identify the C-terminal 
peptide to determine whether cleavage of tACEA36-gl, -gl3 and -gl234 occurrs at 
the same residues as tACE and 2) to investigate the glycosylation status of the seventh 
recognition sequon which lies seven residues proximal to the tACE cleavage site. The 
mass spectra of endoproteinase Lys-C digested peptides of tACEA36-gl, -gl3 and 

15 -gl234 reveal three [M+H] + ions of m/z 1698.7, 1690.5 and 1690.9, respectively 
(Table 2), which are in close agreement with the theoretical mass of the C-terminal 
peptide (calculated m/z 1690.8). 

Thus, tACEA36-gl, -gl3 and -gl234 proteolysis occurs between Arg627 and Ser628, 
20 which is at the same site as tACE-wt. The spectra of the mutant tACE A3 6N J reveals a 
[M+H] + ion at m/z 1463.1 which corresponds to the calculated mass of the peptide 
Leu614 to Ser625 confirming that truncation occurrs at Ser625 and that there is no 
further limited proteolysis of the C-terminus (Table 2). 

25 Example 4 

Structure of ACE 

The crystal structure of the native ACE enzyme is determined to 2.0 A resolution. 
30 The structure from residues 71-648 adopts an overall ellipsoid shape (dimensions 
~72A x 57A x 48A) with a central groove that extends for about 30A into the 
molecule and effectively divides the protein into two "domains" (labelled I and II, 
Fig. 7a,b). On top of the molecule there is a N-terminal "flap" that allows access to 
the active site cleft Like the recently determined zinc-peptidase structure-neurolysin, 
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ACE is also bristling with helices and is comprised of twenty a-helices (with almost 
equal distribution of helices in both domains) and seven 3io-helices (Fig. 7c). This 
makes the molecular architecture very compact and rigid as observed from the 
thermal parameters (crystallographic B-factors) for the structure. The only P-structure 
5 which accounts for 4% of all residues occurs as six relatively short strands, two of 
which are located near the active site (Fig. 7a). The boundaries of the groove are 
provided by domain II (ocl3, ocl4, ocl5, ocl7, P4) with part of the domain I forming 
the flap (al, a2, a3). In addition the base of the scaffold is formed by helices a6 and 
a8 and helix a 15 serves as an anchor. 

10 

Eight C-terminal residues are disordered in the crystal structure and most of this 
flexible region constitutes the unconstrained juxtamembrane region distal to the wild- 
type ACE cleavage site. The a-helix from Ala 620 to His 641(o20) defines the C- 
terminal boundary of the ectodomain and this is in agreement with previous ACE 

15 mutagenesis and cleavage-secretion studies (Ehlers et al, 1996; Chubb et al, 2002). 
Residues Ser 466-Gly 469 are disordered in the structure. Five hundred and four 
ordered water molecules were identified in the native structure. All six glycosylation 
sites (gl-6, Fig. 7c) are exposed and gl site is closest to the N-tenninus of the 
molecule. Sketchy density was observed for N-linked carbohydrates in all the 

20 glycosylation sites (Fig 7c) and modelled as N-acetylglucosamine (NAG) in the 
native structure. The co-ordinates of underglycosylated tACEA36NJ ACE are shown 
in Table A. 

Example 5 

25 Zinc co-ordination 

ACE belongs to the gluzincin family of metalloproteases (Hooper, 1994). Zinc is 
known to be an important catalytic component of ACE. In the structure, the active site 
helices <xl3 and <xl4 define the substrate binding cleft of this peptidase. One highly 
ordered zinc ion (B-factor=16.34A 2 ) is bound at the active site. The helix otl3 
30 contains the HEXXH zinc binding motif (known to be important for the catalytic 
activity of ACE, Wei et al., 1991), with its two zinc coordinating histidines (His 414 
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and His 418) and Glu 442 (from al4 helix) acts as the third ligand (Fig 7c). 
Additional coordination is provided by an acetate ion (from the crystallization 
medium) bound at the active site of the native enzyme (Fig. 8a). It has been reported 
previously that the recruitment of zinc ion in ACE catalysis is analogous to that in 
5 thermolysin (Williams et al, 1994; Corvol et al, 1995) and our structural data shows 
that the zinc binding sites in both proteins are indeed very similar (r.m.s.deviation 
1.52A) except the acetate ion which is replaced by a water molecule in the 
coordination sphere in thermolysin. The active site pocket in ACE is occupied by 3 
ordered water molecules. In addition, a stretch of clear electron density (at 3a level) 
10 was observed (~5A away from the zinc ion) at the active site which is interpreted as 
part of butyl succinate (Fig 8a). 

Example 6 

Activation of ACE by chloride ions 

15 

Substrate hydrolysis by ACE is activated by chloride anions and, whereas some 
substrates can be cleaved in the absence of chloride, it is an absolute requirement for 
angiotensin I (Bunning and Riordan, 1983; Shapiro et al, 1983). Our structure 
revealed the location of two chloride ions (Fig. 7a, c). The first chloride ion (Cll, 

20 20.7A away from the zinc ion) is bound to three ligands, Arg 520 (NH1), Arg 217 
(NE), Trp 516 (NE1) and water and is surrounded by a hydrophobic shell of four 
tryptophans. The second anion (C12, 10.4A away from the zinc ion) is bound to Arg 
553 (NE) in agreement with previous report showing Arg 1098 (the analogous Arg 
residue in the C-domain of somatic ACE) is critical for chloride dependence of ACE 

25 activity (Liu et al, 2001). Tyr 255 and a water molecule are the other two C12 ligands. 
The two chloride ions (separated by 20.3 A) in ACE are bound almost in a triangular 
conformation -in accordance with the near equatorial coordination and 
accommodation of planar trigonal anions in a-amylase where the chloride ions are 
involved in the allosteric activation (Aghajari et al, 2002). In addition, the anchoring 

30 of Domain II on Domain I scaffold appears to be mediated by the chloride ions. 



Mutating Arg 1098 to a lysine resulted in a 100-fold reduction in chloride binding 
affinity as well as a decrease in substrate affinity (Liu et al, 2001). This might be due 
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to the contact with the NE atom of arginine as opposed to an NH1 or NH2 atom with 
lysine which would permit free rotation of the basic terminal nitrogens. Recent studies 
suggest that the affinity of chloride binding to ACE is directly related to the 
interactions between the chloride and enzyme as well as the substrate (Liu et al, 

5 2001). However, based on the location of both chloride ions in the 3D structure make 
the latter interaction unlikely. Nevertheless, there is a van der Waals contact between 
C12 and Pro 438 (ocl4) that could be responsible for maintaining the active site cleft in 
a conformation which favours substrate accessibility and binding. Indeed al4 contain 
the third zinc co-ordinating ligand Glu 442. Thus, the chloride interactions likely 

10 cause the enzyme to adopt conformations that permit higher affinity substrate binding 
and that it is not important in substrate binding per se. 

Example 7 

15 Structure determination and refinement of an ACE-lisinopril complex 

The crystal structure of tACE bound to the potent lisinopril inhibitor (Ki=27 x 10 10 M, 
Wei et aL, 1991) complex is determined by a combination of MAD and MIRAS 
(Multiple Isomorphous Replacement with Anomalous Scattering) procedures. The 

20 position of zinc atom is unambiguously identified using the anomalous difference 
Patterson maps calculated using diffractions data at peak wavelength. The MAD 
phases thus obtained are not very strong and thus additional phase information is 
obtained using MIRAS procedures with three- Platinum (K^PtCLO, Palladium 
(KaPdCU) and Osmium (OsCl 3 ) heavy atom derivatives. The identified Zn site is 

25 used to obtain the starting phases in each derivative. Double difference Fourier maps 
calculated using FFT routine in CCP4 program [Collaborative computational project 
Number 4. The CCP4 Suite: Programs for Protein Crystallography (1994) Acta 
Crystallogr. D 50, 760-763] gave the first major binding site and the phases from the 
combined Zn and first major site are used to get additional major/minor sites for each 

30 derivative. All heavy atom binding sites and the Zn site are refined to 2.8 A 
resolution using the program MLPHARE [Collaborative computational project 
Number 4. The CCP4 Suite: Programs for Protein Crystallography (1994) Acta 
Crystallogr. D 50, 760-763] and SHARP [De La Fortelle, R & Bricogne, G. 
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Maximum-likelihood heavy-atom parameters refinement in the MIR and MAD 
methods (1997) Methods Enzymol. 276, 472-494]. The overall figure of merit from 
SHARP is 0.38 (at 2.8 A resolution) and is improved to 0.89 (at 2.0 A) by iterative 
solvent flattening, phase combination and phase extension with the program 
5 SOLOMON [Abrahams, J. P. & Leslie, A. G. W. Methods used in structure 
determination of bovine mitochondrial Fl ATPase. (1996) Acta CrystaUogr. D 52, 
110-119]. Model building is carried out manually using the program O [Jones, T. A., 
Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. Improved methods for building protein 
models in electron density maps and the location of errors in these models (1991) 

10 Acta CrystaUogr. A 47, 110-1 19]. Refinement of the model is carried out using the 
program CNS [Brunger, A. T. et al. Crystallography & NMR System: A new software 
suite for macromolecular structure determination. (1998) Acta CrystaUogr. D 54, 905- 
921]. During the final stages of refinement water molecules, zinc ion and the 
inhibitor molecule are inserted in the respective structure. Both the native and 

15 inhibitor complex structures are refined at 2.0A resolution with -94 % residues in the 
maximum allowed region and no residue in the disallowed region of the 
Ramachandran map. The final structure of the tACE and ACE- lisinopril complex has 
an Rftee [Brunger, A. T. Free R value: a novel statistical quantity for assessing the 
accuracy of crystal structure. (1992) Nature 355, 472-475] of 22.08 and 21.88 %, and 

20 a final of 18.29 and 18.14 % respectively. 

Highly ordered binding of the inhibitor (overall B-factor 15.26A 2 ) in an extended 
conformation with the aromatic phenyl group stretching towards the flap (Fig 7a) and 
its lysine side chain parallel to the HEXXH motif (Table 1). Upon complex formation 

25 no significant rearrangement of active site residues was observed. The Pi carboxylate 
of lisinopril which is substituted for the normal substrate scissile amide carbonyl is 
well positioned to bind to the active site zinc (located between subsites Si and Si*). 
Thus lisinopril provides a coordinating atom (provided by an acetate ion in the native 
structure, Fig 8a), while the other three (two histidines and a glutamic acid) being the 

30 same as in the native structure (Fig. 8b). The phenyl ethyl group binds to the Si 
subsite and the lysyl amine to Glu 193 via van der Waals interactions at the Si' subsite 
of ACE. Surprisingly, the C-terminal carboxylate which is thought to interact with a 
positively charged arginine residue instead binds to a lysine (Lys 542) as well as Tyr 
551. The high affinity is attributable to the presence of the S-configuration aromatic, 
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aralkyl or aliphatic groups able to bind to the Si site (Brenner et al., 1990). The 
contact residues for the lisinopril complex are shown in Table L The co-ordinates of 
the underglycosylated tACEA36NJ ACE-lisinopril complex are shown in Table B. 

5 Example 8 

The homology of ACE with other metallopeptidases 

ACE belongs to the M2 family of zinc binding metallopeptidases which falls under 
10 the umbrella of the MA clan. Outside the HEXXH metal coordinating signature 
sequence of this family there is little sequence similarity between ACE and other 
members of the family. Surprisingly, peptidases such as thermolysin (MA clan, M4 
family) and carboxypeptidase A (MC clan, M14 family) which have been used in 
comparative molecular field analysis and 3D quantitative structure-activity 
15 relationship studies (Waller & Marshall, 1993) showed no structural homology with 
ACE. 

Surprisingly, structural comparison of ACE with other protein structures using the 
DALI server (Holm and Sander, 1999) showed significant structural homology with 

20 that of neurolysin (Brown et al, 2001), a protein involved in neurotensin metabolism 
(Fig. 3a, b). Neurolysin is a member of the M3 family of thimet oligopeptidases and 
like ACE, belongs to the family of metallopeptidases bearing the HEXXH active site 
motif (Rawlings and Barrett, 1995; Brown et al, 2001). It also comprises of an 
abundance of a-helices with the p-structure accounting for only 6% of all residues. 

25 The two proteins do not exhibit any amino acid sequence similarity (close to random 
score), yet when the two structures are optimally superimposed (using DALI server), 
there is noticeable match with an r.m.s. deviation of 4.0A for 143 C a atoms. It appears 
that the core structure for the two proteins are highly similar with significant 
differences in loops on the outer surface in the case of neurolysin (Fig. 3a,b). The 

30 striking similarity also extends to the active site region in neurolysin consisting of a 
deep narrow channel that divides the molecule into two halves. It has been speculated 
that using the flexible secondary structure elements in the active site cavity, the 
neuropeptidase can effectively cleave a variety of small peptides. Likewise in ACE, 
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the geometry of the active site groove clearly accounts for ACE's inability to 
hydrolyse large, folded substrates. Furthermore, the enzyme's preference for 
oligopeptide substrates of about thirteen residues or less suggests that the substrate 
does not have the same freedom to extend outside of the channel during catalysis. 

5 

Example 9 

Substrate specificity 

10 The geometry of the active site channel clearly accounts for ACE's inability to 
hydrolyse large, folded substrates. Furthermore, the enzyme's preference for 
oligopeptide substrates of about thirteen residues or less suggests that the substrate 
does not have the same freedom to extend outside of the channel during catalysis, 

15 Example 10 

Crystallization of ACE mutants 

tACEA36-gl234 and tACE A3 6-g 1,3 mutants are successfully crystallised under 
20 similar conditions as tACEA36NJ protein (Fig 5). Preliminary diffraction experiments 
show that tACEA36-gl234 crystals diffract to at least 2.8 A and tACEA36-gl,3 
crystals diffract to 3.0 A respectively and these crystals are isomorphous with 
1ACEA36 crystals. 

25 CONCLUSIONS 

The minimal glycosylation requirements for the expression and processing of 
enzymatically-active human tACE are determined by progressive deglycosylation. To 
investigate which sugars are vital for tACE expression, a series of mutants are 
30 constructed that contain a limited number of sites for Af-linked glycosylation. N- 
glycan recognition sequons within the protein sequence are disrupted by mutagenesis 
of the Asn to a Gin within the recognition sequon. 
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The removal of a large percentage of glycans does not hamper the expression of 
enzymatically-active tACEA36 and the presence of glycans at two of the three N- 
terminal sites is sufficient to produce protein that retain the kinetic properties of the 
native enzyme. Furthermore, the minimum JV-linked glycosylation that allowed for 
5 enzymatically-active tACE to be expressed is the presence of glycans at the first or 
third site. Glycosylation at either of these sites appears to be sufficient for protein 
folding, whereas glycosylation of the second site only, results in rapid intracellular 
degradation. This is in agreement with the glycosylation requirements of active rabbit 
ACE expressed in mammalian cells and yeast which also has a minimal requirement 
10 of at least one of the three N-terminal sites (3a). 

In the construction of the tACEA36 glycosylation mutants, the seventh site which is 
not present in rabbit tACE is not targeted as it has been shown to be unglycosylated 
(7a). With the removal of the glycosylated sequons, it is possible that this site may be 
15 glycosylated in order to compensate for the loss of oligosaccharide chains. However, 
this is not the case and that even when there is only one JV-linked oligosccharide, this 
does not occur. 

There is a preference for oligosaccharides to be attached to sequons containing 
20 threonine residues rather than serine residues. This is observed in human tACE, 
where all sequons with a serine in the third position are partially glycosylated and 
those with threonine are fully glycosylated, except for the seventh site, which has a 
threonine in the third position (7a). 

25 The fact that glycosylation at the seventh site appears to be occluded by the presence 
of the tryptophan, rather than the structure of the juxtamembrane region, is supported 
by the findings that a tACE deletion mutant, tACE-A6, which generates a novel Asn- 
Arg-Ser sequon adjacent to the wild-type cleavage site, is glycosylated (14a). The 
extended conformation of oligosaccharides may prevent access of the secretase at this 

30 point resulting in displacement of the wild-type cleavage site 13 residues towards the 
C-terminus (14a). 
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CHO cells expressing wild-type tACE (tACE-wt) are exposed to tunicamycin, which 
inhibits N-linked glycosylation, to determine the effect of complete deglycosylation 
on the expression and activity of tACE over time. Cellular tACE-wt activity in cells 
treated with tunicamycin is approximately three-fold lower than that of untreated cells 
5 (data not shown) and a smaller immature form of the protein is observed by Western 
blot analysis. These results are consistent with the interpretation that the smaller 85 
kDa-protein is the deglycosylated form and the 105 kDa-protein, the mature 
glycosylated form (7a), (9a), (14a). Thus, tunicamycin inhibits the formation of 
mature glycosylated tACE-wt, to a certain degree, resulting in the accumulation of 
10 unglycosylated tACE-wt in the cell, without a significant decrease in the total amount 
of protein compared to untreated cells. Our data suggest that unglycosylated tACE-wt 
is not solubilised and that it requires the presence of N-glycans for proper expression, 
maturation and processing. 

15 The enzymatically-active tACE-wt present in cells treated with tunicamycin is most 
likely the product of a pool of glycosylated tACE-wt, which is either produced prior 
to tunicamycin treatment or is resistant to tunicamycin. Tunicamycin may therefore 
not be completely effective in CHO cells, whereas in HeLa cells, Kasturi et al. found 
that tunicamycin-treatment resulted in the generation of deglycosylated protein that is 

20 degraded intracellularly within three-hours, with no soluble tACE being detected (5a). 
Thus, it appears that the effect of tunicamycin on tACE glycosylation appears to be 
cell-specific. 

Enzymatic removal of iV-linked glycans is used to produce crystals of CD2 
25 glycoprotein that diffracted to 2.5 A (15a). However, crystallisation trials with 
deglycosylated 1ACEA36NJ that has a single GlcNAc residue attached to the 
asparagine residue of the ^/-linked site only produced needle-like crystals that are not 
suitable for further diffraction studies. Surprisingly, tACEA36NJ expressed in the 
presence of NB-DNJ, but retaining the simple high mannose oligosaccharides 
30 (Glc 3 Man 7 GlcNAc2) yielded the best crystals for X-ray diffraction studies and 
subsequent successful 3D structure determination at 2.0A resolution. 
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Interestingly, the presence of the membrane anchor increased the extent of human 
CD59 glycan processing (16a). Possible reasons for this difference are, that the 
ectodomain is held closer to the glycan-processing enzymes and/or the longer 
exposure to the glycosyltransferases. The oligosaccharide processing of soluble 
5 tACEA36NJ protein may also differ from the membrane-bound form however, 
expressing the protein in the presence of a glucosidase inhibitor would prevent the 
production of complex oligosaccharides. 

Furthermore, the kinetic properties of this mutant are not influenced by NB-DNJ 
10 treatment. Similarly, Yu et aL reported identical K m values for glycosylated and 
deglycosylated forms of tACEA36NJ, in agreement with that of tACE-wt, using 
furanacryloyl-Phe-Gly-Gly as a substrate (7a). Glucosidase inhibitors, at 
concentrations required to block trimming of the terminal glucose residues and 
subsequent complex oligosaccharide formation, can affect cell maturation and 
15 apoptosis (Misago M, Tsukada J, Fukuda MN, Eto S (2000) Biochem Biophys Res 
Commun. 269, 219-225). This treatment also results in a decrease in protein 
expression (unpublished data). Thus, mutants lacking some of the C-terminal N- 
linked glycans are used with the objective of producing protein with a minimal 
number of oligosaccharides that would crystallise in a reproducible fashion. 

20 

Crystals of glycosylation mutants gl3 and gl234 have been grown under similar 
condition as in the case of 1ACEA36NJ. The glycosylation mutant crystals seem to 
grow faster than tACEA36NJ crystals. However, they are smaller in size in the first 
instance, but have been proven to be suitable for diffraction work. 

25 

The determination of the 3D structure of ACE and ACE-lisinopril complex may be 
effectively used for the development of novel, highly selective ACE modulators 
targeted, for example, to either the N or the C domain, by structure-based rational 
drug design. This may produce a new generation of ACE inhibitors with the potential 
30 for greater efficacy, fewer side effects and treatment of new indications (e.g. 
polycythemia). In addition, the unanticipated similarity with neurolysin has shown 
the structural conservation amongst an emerging family of peptidases with a common 
evolutionary origin. 
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In a further aspect, the present invention relates to a composition comprising ACE in 
a crystalline form. 

5 In a further aspect, the present invention relates to a scalable 3D model of ACE 
having at least a portion of the structure co-ordinates shown in Table A or Table B. 

All publications mentioned in the above specification are herein incorporated by 
reference. Various modifications and variations of the described methods and system 

10 of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention 

15 which are obvious to those skilled in molecular biology or related fields are intended 
to be within the scope of the following claims. 
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Table 1 



ACE-Lisinopril Contact Residues (hydrogen bonds at the active site) 



Ligand atoms 


Interacting atoms 


Distance (A) 


Lisinopril 






atom 






Ol 


His 384 NE2 


2.76 


Nl 


His 384 NE2 


3.24 


Nl 


Ala 385 O 


2.92 


04 


Lys 542 NZ 


2.93 


04 


Tyr 551 OH 


2.56 


03 


Tyr 554 OH 


2.77 


02 


Wat 


2.82 


05 


Wat 


2.68 


N3 


Wat 


3.01 


N3 


Wat 


3.26 


05 


Wat 


2.77 


N3 


Wat 


3.11 


02 


Glu 415 OE2 


2.70 


Ol 


His 544 NE2 


3.11 



Zinc Coordination-ACE-Lisinopril Complex 



Coordination Distance (A) 

Zn His 414 NE2 2.04 

Zn His418NE2 2.07 

Zn Glu 442 OE1 2.00 

Zn Lis 03 2.14 



Lis = Lisinopril 
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Table 2 



Data Collection 



Crystal 


Wavelength(A) 


Resolution range a 
(A) 


% 

completeness 0 


<t>/< a> 8 


Rmerge 

(%) 


MAD Phasing 


ZnPeak 1.2825 


50-2.00(2.07-2.00) 


94.7 (63.2) 


11.54(1.93) 


5.4(22.3) 


z,n inn. l .zoiz 


50-2.01(2.08-2.01) 


95.9(72.1) 


11.55(1.98) 


5.8(23.8) 


Znrem. 0.9537 


50-1.98(2.05-1.98) 


94.1(65.8) 


15.48(2.63) 


4.8(14.9) 


Long Wavelength 
1.7712 


50-2.66(2.76-2.66) 


99.6(97.0) 


22.92(4.94) 


8.3(21.8) 


MIRAS 
Phasing data 












Pt 


0.87 


20-2.80(2.9-2.8) 


90.2(83.6) 


9.94(4.52) 


11.2(27.1) 


Pt25 


0.87 


20-2.60(2.69-2.6) 


89.9(75.9) 


13.6(4.17) 


8.8(27.8) 


Pd 


0.978 


50-2.18(2.26-2.18) 


89.4(81.2) 


26.14(6.73) 


5.8(25.8) 


Pd2 


0.978 


30-2.60(2.69-2.6) 


95.2(86.7) 


16.95(5.46) 


9.5(35.7) 


Oscl 


1.488 


50-2.80(2.9-2.8) 


96.6(91.4) 


18.51(6.16) 


13.1(40.4) 


Native Data 


1.488 


50-2.00(2.07-2.0) 


98.8(95.7) 


22.46(4.44) 


8.1(43.2) 



^Refinement 

5 



Model 


Resolution 
(A) 


Reflections 
woridng/test 


(%) 


%) 


Rms deviations 


BfectorRmsd(A z ) 












Bond 

length 

(A) 


Bond 
angle 
(°) 


Bonded 
Main Chain 
Atoms 


Bonded 
Side 
Chain 
Atoms 


Native 
(ACE) 


47.14-2.0 


39727 / 1675 


18.29 


22.08 


0.005 


1.22 


1.281 


2.126 


Lisinopril 

complex(A 

CE-LIS) 


47.14-2.0 


76164/3094 


18.14 


21.88 


0.0055 


1.24 


1.174 


2.048 



a Values in parentheses are for the outer resolution shell. 

b R cl crBc = 211- <£> \ I 2 I, where I is the observed intensity and <I> is the average intensity obtained 
from multiple observations of symmetry related reflections. 
10 c R CI y S t=E|Fo|-|Fc|/E|Fo| where Fo and Fc are observed and calculated structure factors respectively. 
d R ftcc is obtained for a test set of reflections, consisting of a randomly selected - 4 % of reflections. 
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Table 3 

Kinetic parameters for the hydrolysis of Hip-His-Leu by various tACE glycoforms 

5 





K m {mNi) 


*cat(sec ; 


/Ccat/A-m \acC HiiVl ) 


tACE-wt 


9 7 


iy j 


72 


tACEA36 -el 


1.6 


128 


79 






1 1 1 

ill 


44 


tACE A3 6 -gl3 


2.7 


170 


63 


tACE A3 6 -g2 


inactive 


inactive 


inactive 


tACEA36-gl2 


2.1 


195 


94 


tACE A3 6 -gl23 


2.7 


189 


71 


tACE A3 6 -gl234 


1.5 


85 


56 


tACEA36-NJ' 


2.6 


253 


99 


tACEA36-NJ 2 


4.1 


289 


70 



10 



1 Expressed in CHO cells treated with NB-DNJ. 

2 Expressed in NB-DNJ-untreated CHO cells. 
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Table 4 

Mass Spectral Analysis of C-Terminal Endoproteinase Lys-C Peptides 6 



10 



Peptide 










(residue no. b ) 


tACEA36-gl 


tACEA36g-l 


tACEA36-gl234 


tACEA36NJ 


598-613° 
614-627 d 
614-625 d 


1951.6 (1951.2) 

1689.7 (1690.8) 


1950.6(1951.2) 
1690.5 (1690.8) 


1950.2 (19512) 
1690.9 (1690.5) 


1951.5 (1951.2) 
1463.1 (1463.5) 



a Soluble (shed) ACE proteins were purified from conditioned medium of transfected cells, digested 
with endoproteinase Lys-C and analyzed by MALDI-TOF mass spectrometry. 

15 

b Amino acid residue numbering refers to wild type tACE (Ehlers et al 1989). Shown are the masses of 
the penultimate c and ultimate d C-terminal peptides. All values are calculated for protonated 
isotopically averaged molecular masses m/z. In parentheses are expected masses. 



25 
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Table A 

Co-ordinates of underglycosylated tACEA36NJ ACE 

5 

REMARK coordinates from minimization refinement 
REMARK refinement resolution: 47.14 - 2.0 A 
REMARK starting r= 0.1829 free_r= 0.2206 
REMARK final r= 0.1829 free_r= 0.2208 
10 REMARK rmsd bonds= 0.005706 rmsd angles= 1.22440 
REMARK wa= 0.837519 

REMARK target= mlf cycles= 1 steps= 20 

REMARK sg= P2(1)2(1)2{1) a= 56.621 b= 85.062 c= 133.791 alpha- 90 

beta= 90 gamma= 90 
15 REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 2.0 

REMARK initial B-factor correction applied to fobs : 

REMARK Bll- 2.826 B22= -1.896 B33= -0.930 

REMARK B12= 0.000 B13= 0.000 B23= 0.000 
20 REMARK B-factor correction applied to coordinate array B: 0.207 

REMARK bulk solvent: density level= 0.355826 e/A^, B-factor= 50.0907 

A A 2 

REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
REMARK reflections with I Fobs | > 10000 * rms(Fobs) rejected 
25 REMARK theoretical total number of refl. in resol . range: 44 472 { 
100.0 % ) 

REMARK number of unobserved reflections (no entry or |F|=0) :3070( 
6.9 % ) 

REMARK number of reflections rejected: 0 ( 0.0 % ) 

30 REMARK total number of reflections used: 41402 ( 93 . 1 % ) 

REMARK number of reflections in working set: 39727 ( 8 9.3 % ) 
REMARK number of reflections in test set: 167 5 ( 3.8 % ) 

CRYST1 56.621 85.062 133.791 90.00 90.00 90.00 P 21 21 21 
REMARK VERSION: 1.1 
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74 37,272 65.799 

74 37.217 64.597 

75 37.482 66.273 
75 37.643 65.395 
75 37.619 66.217 
75 38.917 64.558 

75 38.871 63.343 

76 40.053 65.204 
76 41.319 64.486 
76 42.486 65.468 
76 42.391 66.266. 
76 41.326 63.562 

76 41.948 62.501 

77 40.631 63.974 
77 40.534 63.179 
77 39.924 64.021 
77 39.438 63.219 
77 38.889 64.135 
77 38.072 63.355 
77 36.868 62.741 
77 39.662 61.952 

77 39.966 60.844 

78 38.573 62.163 
78 37.661 61.079 
78 36.495 61.610 
78 35.634 60.527 
78 34.772 59.788 
78 35.700 60.238 
78 33.983 58.776 
78 34.914 59.224 
78 34.055 58.493 
78 38.407 60.018 

78 38.293 58.825 

79 39.174 60.460 
79 39.928 59.539 
79 40.647 60.296 
79 41.588 59.362 
79 39.618 60.868 
79 40.944 58.732 

79 41.109 57.527 

80 41.624 59.390 
80 42.605 58.707 
80 43.302 59.699 
80 44.036 60.832 
80 44.759 61.760 
80 44.149 .62.161 
80 45.934 62.096 
80 41.910 57.627 

80 42.408 56.504 

81 40.759 57.969 
81 40.011 57.013 
81 38.831 57.706 
81 39.227 58.391 
81 38.161 59.326 
81 36.960 58.986 
81 38.531 60.399 
81 39.524 55.860 

81 39.678 54.686 

82 38.948 56.197 
82 38.461 55.179 
82 37.927 55.832 
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32,366 58,055 56.296 1.00 26.64 

34.604 57.892 56.476 1.00 26.33 

34.632 56.439 56.372 1.00 25.57 

36.080 55.970 56.205 1.00 28.20 

36.244 54.497 55.885 1.00 32.16 

37.640 54.228 55.333 1.00 36.26 

37.815 52.773 54.930 1.00 37.26 

37.763 51.868 56.105 1.00 39.38 

33.989 55.754 57.581 1.00 24.69 

33.302 54.739 57.435 1.00 24.36 

34.205 56.307 58.772 1.00 22.37 

33.632 55.727 59.981 1.00 22.59 

34.270 56.341 61.254 1.00 23.63 

33.651 55.723 62.502 1.00 20.79 

35.776 56.099 61.243 1.00 22.23 

32.111 55.899 60.035 1.00 21.68 

31.426 55.151 60.730 1.00 21.12 

31.581 56.871 59.294. 1.00 20.83 

30.133 57.097 59.267 1.00 21.36 

29.804 58.423 58.566 1.00 21.83 

30.424 59.658 59.216 1.00 22.21 

30.025 60.949 58.514 1.00 26.18 

29.786 60.960 57.306 1.00 26.64 

29.969 62.046 59.266 1.00 23.87 

29.405 55.948 58.555 1.00 21.63 

28.178 55.831 58.637 1.00 21.51 
30.162 55.111 57.848 1.00 19.66 
29.596 53.958 57.141 1.00 21.29 
30.413 53.657 55.875 1.00 19.46 
29.783 52.574 55.004 1.00 21.81 
29.034 51.724 55.528 1.00 21.84 
30.056 52.563 53.784 1.00 22.86 
29.690 52.781 58.109 1.00 20.99 
30.772 52.240 58.323 1.00 20.95 
28.558 52.386 58.688 1.00 22.58 
28.535 51.292 59.661 1.00 22.39 
27.330 51.433 60.593 1.00 22.04 

26.824 52.837 60.927 1.00 25.61 
25.697 52.735 61.951 1.00 25.99 
27.956 53.680 61.460 1.00 22.76 
28.460 49.919 59.013 1.00 22.78 
28.512 48.902 59.704 1.00 21.91 
28.341 49.894 57.691 1.00 22.03 
28.202 48.644 56.965 1.00 23.75 
29.540 47.902 56.894 1.00 26.94 

30.179 48.049 55.508 1.00 34.66 
31.604 48.553 55.547 1.00 38.29 
32.505 47.776 55.938 1.00 41.36 

31.825 49.729 55.185 1.00 42.09 
27.103 47.792 57.603 1.00 21.61 
26.055 48.324 57.952 1.00 22.87 
27.323 46.493 57.773 1.00 20.09 
26.275 45.646 58.340 1.00 21.46 
26.742 44.192 58.430 1.00 20.77 
27.718 43.918 59.551 1.00 19.96 
28.186 42.480 59.509 1.00 23.32 
28.950 42.205 58.299 1.00 25.43 
29.456 41.017 57.994 1.00 29-86 
29.274 39.987 58.814 1.00 28.71 
30.152 40.858 56.873 1.00 31.49 
25.784 46.105 59.712 1.00 21.73 
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155 24.638 45.846 

156 26,645 46.787 
156 26.270 47.264 
156 27.478 47.878 
156 25.117 48.270 

156 24.566 48.620 

157 24.748 48.735 
157 23.642 49.684 
157 23.721 50.438 
157 22.297 48.966 

157 21.258 49.598 

158 22.322 47.642 
158 21.103 46.834 
158 21.365 45.441 
158 21.629 45.249 
158 22.085 43.821 
158 20.357 45.554 
158 20.542 46.655 

158 21.296 46.550 

159 19.202 46.616 
159 18.180 46.863 
159 18.582 46.430 
159 17.096 46.342 
159 16,970 47.241 
159 19.122 45.111 

159 19,384 44.182 

160 19.289 45.035 
160 19.818 43.841 
160 19.536 43.909 
160 19.296 42.516 

160 20.071 41.584 

161 17.989 42.431 
161 17.397 41.201 
161 15.871 41.313 
161 15.167 40.022 
161 13.656 40.124 
161 13.114 40.363 
161 12.967 39.947 
161 17.909 40.844 

161 18.313 39.705 

162 17.884 41.815 
162 18.353 41.595 
162 18.056 42.812 
162 16.648 42.872 
162 15.596 43.218 
162 15.349 42.396 
162 15.015 44.320 
162 19.852 41.319 

162 20.338 40.564 

163 20.583 41.933 
163 22.027 41.741 
163 22.658 42.686 
163 24.167 42.535 
163 24.947 42.725 
163 24.614 43.548 
163 22.382 40.299 

163 23.248 39.700 

164 21.717 39.728 
164 22.031 38.356 
164 21.362 37.974 
164 19.950 37.445 
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19.405 37.035 66.171 1.00 40.22 
18.302 36.450 66.224 1.00 41.83 
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24.412 
24.302 
25.551 
25.979 
26.499 
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27.810 
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27.511 
28.217 
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242 
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25 


.478 


36. 


884 


1. 


00 


14. 


78 




ATOM 


1951 


O 


MET 


309 


33. 


996 


25 


.220 


36. 


548 


1. 


00 


13. 


18 


35 


ATOM 


1952 


N 


TRP 


310 


35. 


671 


26 


.703 


36. 


812 


1. 


00 


11. 


51 




ATOM 


1953 


CA 


TRP 


310 


34. 


872 


27 


.843 


36. 


366 


1. 


00 


14. 


25 




ATOM 


1954 


CB 


TRP 


310 


34. 


970 


28 


.968 


37. 


403 


1. 


00 


12. 


20 




ATOM 


1955 


CG 


TRP 


310 


34. 


625 


28 


.480 


38. 


772 


1. 


00 


12. 


52 




ATOM 


1956 


CD2 


TRP 


310 


35. 


538 


27 


.959 


39. 


745 


1. 


00 


12. 


82 


40 


ATOM 


1957 


CE2 


TRP 


310 


34. 


764 


27 


.461 


40. 


817 


1. 


00 


13. 


24 




ATOM 


1958 


CE3 


TRP 


310 


36. 


937 


27 


.855 


39. 


810 


1. 


00 


13. 


78 




ATOM 


1959 


CD1 


TRP 


310 


33. 


373 


28 


.295 


39. 


283 


1. 


00 


11. 


84 




ATOM 


1960 


NE1 


TRP 


310 


33. 


447 


27 


.681 


40.509 


1. 


00 


12. 


66 




ATOM 


1961 


CZ2 


TRP 


310 


35. 


340 


26.866 


41. 


948 


1. 


00 


12. 


55 


45 


ATOM 


1962 


CZ3 


TRP 


310 


37. 


513 


21 


.260 


40. 


934 


1. 


00 


13. 


31 




ATOM 


1963 


CH2 


TRP 


310 


36. 


708 


26 


.773 


41. 


990 


1. 


00 


12. 


52 




ATOM 


1964 


C 


TRP 


310 


35. 


309 


28 


.360 


35. 


001 


1. 


00 


13. 


97 




ATOM 


1965 


O 


TRP 


310 


34. 


772 


29.348 


34. 


508 


1. 


00 


12. 


98 




ATOM 


1966 


N 


ALA 


311 


36. 


269 


27 


.670 


34. 


388 


1. 


00 


14. 


15 


50 


ATOM 


1967 


CA 


ALA 


311 


36. 


799 


28 


.086 


33. 


096 


1. 


00 


13. 


77 




ATOM 


1968 


CB 


ALA 


311 


35. 


719 


27 


.964 


32. 


021 


1. 


00 


14. 


05 




ATOM 


1969 


C 


ALA 


311 


37. 


297 


29 


.534 


33. 


179 


1. 


00 


13. 


58 




ATOM 


1970 


O 


ALA 


311 


37. 


249 


30 


.263 


32. 


189 


1. 


00 


13. 


90 




ATOM 


1971 


N 


GLN 


312 


37. 


761 


29.952 


34. 


358 


1. 


00 


13. 


19 


55 


ATOM 


1972 


CA 


GLN 


312 


38. 


247 


31 


.325 


34. 


541 


1. 


00 


13. 


69 




ATOM 


1973 


CB 


GLN 


312 


38. 


088 


31.770 


36. 


000 


1. 


00 


12. 


28 




ATOM 


1974 


CG 


GLN 


312 


38. 


917 


30 


.995 


36. 


999 


1. 


00 


14. 


52 




ATOM 


1975 


CD 


GLN 


312 


38. 


542 


31 


.318 


38. 


433 


1. 


00 


17. 


57 




ATOM 


1976 


OE1 


GLN 


312 


37. 


382 


31 


.187 


38. 


828 


1. 


00 


15. 


61 


60 


ATOM 


1977 


NE2 


GLN 


312 


39. 


522 


31 


.739 


39. 


220 


1. 


00 


19.96 




ATOM 


1978 


C 


GLN 


312 


39.700 


31 


.485 


34. 


102 


1. 


00 


13. 


47 
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ATOM 


1979 


O 


GLN 


312 


40. 


137 


32. 


587 


33. 


771 


1. 


00 


11.19 




ATOM 


1980 


N 


THR 


313 


40. 


442 


30. 


382 


34. 


124 


1. 


00 


12.33 




ATOM 


1981 


CA 


THR 


313 


41. 


835 


30. 


354 


33. 


687 


1. 


00 


13.83 




ATOM 


1982 


CB 


THR 


313 


42. 


826 


30. 


403 


34. 


875 


1. 


00 


16.20 


5 


ATOM 


1983 


OG1 


THR 


313 


42. 


468 


29. 


425 


35. 


858 


1. 


00 


16.40 




ATOM 


1984 


CG2 


THR 


313 


42. 


819 


31. 


789 


35. 


511 


1. 


00 


17.20 




ATOM 


1985 


C 


THR 


313 


41. 


964 


29. 


040 


32. 


918 


1. 


00 


16.52 




ATOM 


1986 


O 


THR 


313 


41. 


415 


28. 


021 


33. 


335 


1. 


00 


14.73 




ATOM 


1987 


N 


TRP 


314. 


42. 


675 


29. 


065 


31. 


795 


1. 


00 


14.06 


10 


ATOM 


1988 


CA 


TRP 


314 


42. 


788 


27. 


881 


30. 


948 


1. 


00 


15.86 




ATOM 


1989 


CB 


TRP 


314 


42. 


318 


28. 


216 


29. 


529 


1. 


00 


15.32 




ATOM 


1990 


CG 


TRP 


314 


40. 


991 


28. 


934 


29. 


408 


1. 


00 


13.99 




ATOM 


1991 


CD2 


TRP 


314 


40. 


457 


29. 


529 


28. 


219 


1. 


00 


13.70 




ATOM 


1992 


CE2 


TRP 


314 


39. 


164 


30. 


012 


28. 


534 


1. 


00 


14.95 


15 


ATOM 


1993 


CE3 


TRP 


314 


40. 


943 


29.698 


26. 


915 


1. 


00 


12.93 




ATOM 


1994 


CD1 


TRP 


314 


40. 


032 


29. 


079 


30. 


372 


1. 


00 


14.30 




ATOM 


1995 


NE1 


TRP 


314 


38. 


929 


29. 


725 


29. 


854 


1. 


00 


13.72 




ATOM 


1996 


CZ2 


TRP 


314 


38. 


355 


30. 


650 


27. 


592 


1. 


00 


13.70 




ATOM 


1997 


CZ3 


TRP 


314 


40. 


137 


30. 


332 


25. 


979 


1. 


00 


16.03 


20 


ATOM 


1998 


CH2 


TRP 


314 


38. 


853 


30. 


800 


26.325 


1. 


00 


16.16 




ATOM 


1999 


C 


TRP 


314 


44. 


171 


27. 


255 


30. 


816 


1. 


00 


17.18 




ATOM 


2000 


O 


TRP 


314 


44. 


363 


26. 


402 


29. 


950 


1. 


00 


16.26 




ATOM 


2001 


N 


SER 


315 


45. 


125 


27. 


655 


31. 


648 


1. 


00 


16.81 




ATOM 


2002 


CA 


SER 


315 


46. 


482 


27. 


131 


31. 


514 


1. 


00 


18.73 


25 


ATOM 


2003 


CB 


SER 


315 


47. 


429 


27. 


840 


32. 


493 


1. 


00 


19.48 




ATOM 


2004 


OG 


SER 


315 


47. 


078 


27. 


573 


33. 


834 


1. 


00 


23.23 




ATOM 


2005 


C 


SER 


315 


46. 


628 


25. 


618 


31. 


658 


1. 


00 


17.43 




ATOM 


2006 


O 


SER 


315 


47. 


599 


25. 


043 


31. 


165 


1. 


00 


16.20 




ATOM 


2007 


N 


ASN 


316 


45. 


669 


24. 


967 


32. 


309 


1. 


00 


16.52 


30 


ATOM 


2008 


CA 


ASN 


316 


45. 


758 


23. 


526 


32. 


487 


1. 


00 


16.72 




ATOM 


2009 


CB 


ASN 


316 


44. 


827 


23. 


068 


33. 


617 


1. 


00 


16.47 




ATOM 


2010 


CG 


ASN 


316 


45. 


342 


23. 


4 94 


34. 


991 


1. 


00 


18.32 




ATOM 


2011 


OD1 


ASN 


316 


46. 


504 


23. 


262 


35. 


323 


ll 


00 


19.09 




ATOM 


2012 


ND2 


ASN 


316 


44. 


484 


24. 


116 


35. 


788 


1. 


00 


15.86 


35 


ATOM 


2013 


C 


ASN 


316 


45. 


540 


22. 


700 


31. 


219 


1. 


00 


17.50 




ATOM 


2014 


O 


ASN 


316 


45. 


834 


21. 


508 


31. 


210 


1. 


00 


17.87 




ATOM 


2015 


N 


ILE 


317 


45. 


028 


23. 


311 


30.150 


1. 


00 


17.39 




ATOM 


2016 


CA 


ILE 


317 


44. 


869 


22. 


568 


28. 


901 


1. 


00 


17.13 




ATOM 


2017 


CB 


ILE 


317 


43. 


479 


22. 


759 


28. 


246 


1. 


00 


16.82 


40 


ATOM 


2018 


CG2 


ILE 


317 


42. 


394 


22. 


220 


29. 


170 


1. 


00 


16.53 




ATOM 


2019 


CGI 


ILE 


317 


43. 


242 


24. 


230 


27. 


902 


1. 


00 


17.65 




ATOM 


2020 


CD1 


ILE 


317 


42. 


036 


24. 


444 


27. 


019 


1. 


00 


18.86 




ATOM 


2021 


C 


ILE 


317- 


45. 


947 


23. 


014 


27. 


915 


1. 


00 


18.07 




ATOM 


2022 


O 


ILE 


317 


45. 


793 


22. 


892 


26. 


697 


1. 


00 


16.69 


45 


ATOM 


2023 


N 


TYR 


318 


47. 


047 


23. 


536 


28. 


454 


1. 


00 


18.93 




ATOM 


2024 


CA 


TYR 


318 


48. 


155 


23. 


983 


27. 


618 


1. 


00 


20.39 




ATOM 


2025 


CB 


TYR 


318 


49. 


337 


24. 


421 


28. 


479 


1. 


00 


21.07 




ATOM 


2026 


CG 


TYR 


318 


50. 


567 


24. 


757 


27. 


667 


1. 


00 


23.88 




ATOM 


2027 


CD1 


TYR 


318 


50. 


635 


25. 


931 


26. 


917 


1. 


00 


22.79 


50 


ATOM 


2028 


CE1 


TYR 


318 


51. 


749 


26. 


225 


26. 


138 


1. 


00 


25.90 




ATOM 


2029 


CD2 


TYR 


318 


51. 


652 


23. 


879 


27. 


618 


1. 


00 


25.81 




ATOM 


2030 


CE2 


TYR 


318 


52. 


774 


24. 


163 


26. 


841 


1. 


00 


26.76 




ATOM 


2031 


cz 


TYR 


318 


52. 


815 


25. 


335 


26. 


105 


1. 


00 


27.14 




ATOM 


2032 


OH 


TYR 


318 


53. 


919 


25. 


616 


25. 


338 


1. 


00 


27.90 


55 


ATOM 


2033 


c 


TYR 


318 


48. 


609 


22. 


861 


26. 682 


1. 


00 


20.54 




ATOM 


2034 


o 


TYR 


318 


48. 


933 


23. 


108 


25. 


525 


1. 


00 


19.56 




ATOM 


2035 


N 


ASP 


319 


48. 


624 


21. 


634 


27. 


194 


1. 


00 


21.68 




ATOM 


2036 


CA 


ASP 


319 


49. 


041 


20. 


4 68 


26. 


415 


1. 


00 


24.93 




ATOM 


2037 


CB 


ASP 


319 


48. 


942 


19. 


205 


27. 


273 


1. 


00 


27.75 


60 


ATOM 


2038 


CG 


ASP 


319 


47. 


532 


18. 


947 


27. 


770 


1. 


00 


33.89 




ATOM 


2039 


OD1 


ASP 


319 


46. 


943 


19. 


866 


28. 


377 


1. 


00 


37.33 
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ATOM 


2040 


OD2 


ASP 


319 


47.009 


17. 


829 


27.560 


1.00 


36. 


71 




ATOM 


2041 


C 


ASP 


319 


48.214 


20. 


275 


25.144 


1.00 


24. 


42 




ATOM 


2042 


O 


ASP 


319 


48.721 


19. 


790 


24.133 


1.00 


24. 


30 




ATOM 


2043 


N 


LEU 


320 


46.940 


20. 


648 


25.200 


1.00 


22. 


05 


5 


ATOM 


2044 


CA 


LEU 


320 


46.057 


20. 


502 


24.048 


1.00 


21. 


65 




ATOM 


2045 


CB 


LEU 


320 


44.594 


20. 


431 


24.503 


1.00 


21. 


37 




ATOM 


2046 


CG 


LEU 


320 


44.138 


19. 


298 


25.429 


1.00 


20. 


50 




ATOM 


2047 


CD1 


LEU 


320 


42.647 


19. 


458 


25.712 


1.00 


20. 


57 




ATOM 


2048 


CD2 


LEU 


320 


44.414 


17. 


946 


24.780 


1.00 


20. 


05 


10 


ATOM 


2049 


C 


LEU 


320 


46.190 


21. 


626 


23.023 


1.00 


20. 


30 




ATOM 


2050 


O 


LEU 


320 


45.750 


21. 


476 


21.884 


1.00 


20. 


89 




ATOM 


2051 


N 


VAL 


321 


46.803 


22. 


743 


23.411 


1.00 


18. 


20 




ATOM 


2052 


CA 


VAL 


321 


46.911 


23. 


878 


22.497 


1.00 


17. 


14 




ATOM 


2053 


CB 


VAL 


321 


46.014 


25. 


051 


22.986 


1.00 


15. 


95 


15 


ATOM 


2054 


CGI 


VAL 


321 


44.637 


24. 


529 


23.358 


1.00 


16. 


89 




ATOM 


2055 


CG2 


VAL 


321 


46. 648 


25. 


737 


24.182 


1.00 


13. 


25 




ATOM 


2056 


C 


VAL 


321 


48.319 


24. 


,420 


22.267 


1.00 


18. 


07 




ATOM 


2057 


O 


VAL 


321 


48.477 


25. 


.496 


21.697 


1.00 


19. 


68 




ATOM 


2058 


N 


VAL 


322 


49.342 


23. 


,689 


22.699 


1.00 


20. 


47 


20 


ATOM 


2059 


CA 


VAL 


322 


50.715 


24. 


,154 


22.516 


1.00 


23. 


25 




ATOM 


2060 


CB 


VAL 


322 


51.735 


23. 


,050 


22.893 


1.00 


25. 


.43 




ATOM 


2061 


CGI 


VAL 


322 


51.487 


21. 


,801 


22.069 


1.00 


27 . 


.69 




ATOM 


2062 


CG2 


VAL 


322 


53.156 


23, 


.561 


22. 680 


1.00 


27. 


.11 




ATOM 


2063 


C 


VAL 


322 


50,975 


24, 


.620 


21.077 


1,00 


22, 


.80 


25 


ATOM 


2064 


0 


VAL 


322 


50.758 


23, 


.875 


20.123 


1,00 


23. 


.34 




ATOM 


2065 


N 


PRO 


323 


51.432 


25, 


.874 


20.904 


1,00 


23. 


.14 




ATOM 


2066 


CD 


PRO 


323 


51.589 


26. 


.917 


21.934 


1.00 


21, 


.86 




ATOM 


2067 


CA 


PRO 


323 


51.712 


26, 


. 414 


19.566 


1.00 


24, 


.47 




ATOM 


2068 


CB 


PRO 


323 


52.152 


27, 


.848 


19.855 


1.00 


23. 


.04 


30 


ATOM 


2069 


CG 


PRO 


323 


51.422 


28, 


.181 


21.134 


1,00 


22, 


.71 




ATOM 


2070 


C 


PRO 


323 


52.790 


25 


.620 


18.821 


1.00 


26. 


.42 




ATOM 


2071 


O 


PRO 


323 


52.615 


25 


.255 


17.658 


1.00 


26, 


.23 




ATOM 


2072 


N 


PHE 


324 


53.905 


25 


.359 


19.494 


1.00 


27 


.57 




ATOM 


2073 


CA 


PHE 


324 


54.996 


24 


.606 


18.885 


1.00 


28 


.92 


35 


ATOM 


2074 


CB 


PHE 


324 


56.232 


25 


.494 


18.707 


1.00 


29 


.24 




ATOM 


2075 


CG 


PHE 


324 


56.003 


26 


.667 


17.801 


1.00 


28 


.40 




ATOM 


2076 


GDI 


PHE 


324 


' 55.591 


27 


.890 


18.314 


1.00 


28 


.05 




ATOM 


2077 


CD2 


PHE 


324 


56.159 


26 


.538 


16.426 


1.00 


30 


.34 




ATOM 


2078 


CE1 


PHE 


324 


55.333 


28 


.964 


17.473 


1.00 


28 


.58 


40 


ATOM 


2079 


CE2 


PHE 


324 


55.903 


27 


.605 


15.575 


1.00 


28 


.99 




ATOM 


2080 


CZ 


PHE 


324 


55.490 


28 


.823 


16.099 


1.00 


29 


.77 




ATOM 


2081 


C 


PHE 


324 


55.339 


23 


.400 


19.738 


1.00 


29 


.62 




ATOM 


2082 


0 


PHE 


324 


56.221 


23 


.4 60 


20.596 


1.00 


27 


.80 




ATOM 


2083 


N 


PRO 


325 


54.633 


22 


.280 


19.514 


1.00 


31 


.43 


45 


ATOM 


2084 


CD 


PRO 


325 


53.589 


22 


.095 


18.491 


1.00 


32 


.36 




ATOM 


2085 


CA 


PRO 


325 


54.855 


21 


.042 


20.263 


1.00 


34 


.46 




ATOM 


2086 


CB 


PRO 


325 


53.756 


20 


.118 


19.737 


1.00 


33 


.87 




ATOM 


2087 


CG 


PRO 


325 


53.560 


20 


.595 


18.336 


1.00 


33 


.40 




ATOM 


2088 


C 


PRO 


325 


56.257 


20 


.463 


20.083 


1.00 


36 


.18 


50 


ATOM 


2089 


0 


PRO 


325 


56.677 


19 


.593 


20.849 


1.00 


37 


.18 




ATOM 


2090 


N 


SER 


326 


56.980 


20 


.947 


19.077 


1.00 


37 


.49 




ATOM 


2091 


CA 


SER 


326 


58.337 


20 


.473 


18.832 


1.00 


41 


.11 




ATOM 


2092 


CB 


SER 


326 


58.858 


20 


.989 


17.490 


1.00 


42 


.68 




ATOM 


2093 


OG 


SER 


326 


58.218 


20 


.326 


16.416 


1.00 


46 


.06 


55 


ATOM 


2094 


C 


SER 


326 


59.259 


20 


.936 


19.949 


1.00 


41 


.74 




ATOM 


2095 


O 


SER 


326 


60.210 


20 


.246 


20.300 


1.00 


42 


.06 




ATOM 


2096 


N 


ALA 


327 


58.974 


22 


.113 


20.498 


1.00 


42 


.81 




ATOM 


2097 


CA 


ALA 


327 


59.767 


22 


.663 


21.591 


1.00 


44 


.30 




ATOM 


2098 


CB 


ALA 


327 


59.778 


24 


.183 


21.516 


1.00 


44 


.00 


60 


ATOM 


2099 


C 


ALA 


327 


59.138 


22 


.200 


22.903 


1.00 


45 


.75 




ATOM 


2100 


O 


ALA 


327 


58.246 


22 


.857 


23.440 


1.00 


45 


.98 
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ATOM 


2101 


N 


PRO 




ATOM 


2102 


CD 


PRO 




ATOM 


2103 


CA 


PRO 




ATOM 


2104 


CB 


PRO 


5 


ATOM 


2105 


CG 


PRO 




ATOM 


2106 


C 


PRO 




ATOM 


2107 


O 


PRO 




ATOM 


2108 


N 


ALA 




ATOM 


2109 


CA 


ALA 


10 


ATOM 


2110 


CB 


ALA 




ATOM 


2111 


C 


ALA 




ATOM 


2112 


O 


ALA 
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425 46,694 47.324 

425 45.744 46.342 

425 50.354 50.082 

425 49.447 50.758 

426 51.018 49.162 
426 50.678 48.840 
426 51.593 47.727 
426 53.048 48.138 
426 53.883 46.986 
426 55.337 47.396 
426 55.457 48.484 
426 50.708 50.007 

426 50.117 49.914 

427 51.385 51.098 
427 51.473 52.238 
427 52.888 52.816 
427 53.919 51.824 
427 53.697 51.226 
427 54.948 51.644 
427 50.466 53.354 

427 50.487 54.382 

428 49.586 53.163 
428 48.581 54.175 
428 48.159 54.110 
428 49.169 54.334 
428 48.466 54.097 
428 49.731 55.743 
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433 45.026 49.497 
433 45.849 50.225 
433 46.193 49.307 
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707 24.332 47.645 

708 41.493 22.764 

709 57.554 41.574 

710 39.385 24.084 

711 51.141 36.659 

712 53.148 51.275 

713 52.416 34.580 

714 16.540 45.049 

715 47.597 46.867 

716 33.726 54.353 

717 33.205 41.739 

718 37.109 27.771 

719 17.460 40.837 

720 40.996 49.596 

721 37.232 33.021 

722 48.355 52.277 

723 44.253 25.373 

724 37.616 28.428 

725 31.807 45.250 

726 48.929 40.765 

727 43.001 28.525 

728 36.297 33.649 

729 45.223 39.419 

730 19.069 49.921 

731 18.652 48.396 

732 16.355 39.332 

734 27.725 37.511 

735 43.139 43.055 

736 23.288 50.618 

737 40.575 28.7691 

738 44.624 45.246 

739 42.886 25.680 

740 36.434 24.093 

741 39.736 39.843 

742 - 47.972 45.541 

743 48.595 42.613 

744 16.756 44.276 

745 50.833 52.475 

746 43.086 51.380 

747 43.065 37.312 

748 36.377 29.551 

749 31.749 23.341 

750 51.017 32.405 

751 23.902 25.063 

752 46.245 55.817 

753 17.325 38.625 

754 30.477 54.512 

755 30.153 29.098 

756 40.602 56.391 

757 48.469 46.405 

758 , 23.341 22.303 

759 39.109 29.949 

760 41.617 55.292 

761 50.433 32.980 

762 55.349 39.853 

763 17.829 47.302 

764 41.437 43.591 

765 31.453 48.234 

766 35.895 29.226 

767 42.129 47.247 

768 39.938 19.646 
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59 


.107 


55. 


188 


1. 


00 


46 


.96 


32 


.898 


43. 


133 


1. 


00 


29 


.76 


47 


.081 


42. 


243 


1. 


00 


32 


.59 


54 


.350 


56.726 


1. 


00 


53 


.35 


16 


.873 


20. 


860 


1. 


00 


43 


.15 


26 


.154 


61. 


550 


1. 


00 


36 


.80 


36 


.508 


12. 


204 


1. 


00 


33 


.71 


41 


.501 


68. 


026 


1. 


00 


45 


.11 


34 


.803 


51. 


451 


1. 


00 


38 


.53 


38 


.420 


54. 


710 


0. 


50 


29 


.72 


38 


.685 


22. 


833 


1. 


00 


39 


.63 


37 


.523 


50. 


8 94 


1. 


00 


39 


.63 


54 


.366 


25. 


635 


1. 


00 


42 


.49 


25 


.862 


40. 


806 


1. 


00 


24 


.48 


57 


.934 


37. 


750 


1. 


00 


22 


.44 


59 


.362 


48. 


731 


1. 


00 


33 


.75 


57 


.789 


52. 


852 


1. 


00 


40 


.58 


33 


.122 


54. 


732 


1. 


00 


44 


.11 


32 


.265 


27. 


402 


1. 


00 


38 


.80 


17 


.171. 


32. 


905 


1. 


00 


43 


.63 


48 


.121 


27. 


949 


1. 


00 


43 


.58 


59.756 


41. 


894 


1. 


00 


54 


.98 


31 


.175 


68. 


418 


1. 


00 


28 


.64 


33 


.656 


69. 


811 


1. 


00 


40 


.47 


43 


.712 


34. 


681 


1. 


00 


47 


.01 


21 


.093 


19. 


000 


1. 


00 


34 


.17 


33 


.165 


61. 


660 


1. 


00 


40 


.84 


61 


.127 


51. 


030 


1. 


00 


39 


.94 


43 


.982 


59. 


555 


1. 


00 


39 


.42 


41 


.831 


63. 


701 


1. 


00 


42 


.73 


49 


.957 


71. 


786 


1. 


00 


35 


.15 


9 


.964 


32. 


536 


1. 


00 


31 


.00 


21 


.857 


50. 


569 


1. 


00 


43.38 


20 


.267 


50. 


750 


1. 


00 


41 


.28 


9 


.657 


34. 


297 


1. 


00 


39 


.24 


9 


.067 


36. 


007 


1. 


00 


35 


.51 


47 


.272 


35. 


502 


1. 


00 


33 


.10 


58 


.167 


41. 


963 


1. 


00 


39 


.41 


58 


.757 


54. 


952 


1. 


00 


54 


.54 


60 


.561 


60. 


251 


1. 


00 


49 


.35 


37 


.381 


55. 


807 


1. 


00 


42 


.76 


43 


.614 


61. 


340 


1. 


00 


41 


.53 


30 


.515 


55. 


600 


1. 


00 


52 


.02 


10 


.725 


25. 


760 


1. 


00 


26.11 


60 


.192 


39. 


575 


1. 


00 


27 


.92 


44 


.964 


49. 


691 


1. 


00 


25 


.17 


38 


.077 


47. 


755 


1. 


00 


37 


.02 


41 


.927 


47. 


692 


1. 


00 


28 


.52 


43 


.505 


32. 


164 


1. 


00 


33 


.23 


21 


.223 


54. 


177 


1. 


00 


26 


.91 


41 


.142 


51. 


475 


1. 


00 


46 


.99 


61 


.103 


37. 


679 


1. 


00 


41, 


.65 


30 


.169 


45. 


086 


i. 


00 


42, 


.30 


38, 


.583 


24. 


573 


l. 


00 


28, 


.21 


44, 


.155 


50. 


435 


i. 


00 


35, 


.21 


36.178 


62. 


387 


l. 


00 


48, 


.01 


45, 


.800 


61. 


426 


i. 


00 


46, 


.24 



WO 2004/024765 



170 



PCT/GB2003/003966 





ATOM 


5029 


OH2 


WAT 


1144 


56. 


000 


49. 


811 


57. 


192 


1. 


00 


33. 


19 




ATOM 


5030 


OH2 


WAT 


1145 


50. 


931 


47. 


015 


21. 


718 


1. 


00 


41. 


76 




ATOM 


5031 


OH2 


WAT 


1148 


41. 


181 


13. 


664 


41. 


675 


1. 


00 


36.85 




ATOM 


5032 


OH2 


WAT 


1149 


26. 


829 


13. 


900 


42. 


000 


1. 


00 


51. 


60 


5 


ATOM 


5033 


OH2 


WAT 


1150 


38. 


428 


50. 


689 


49. 


680 


1. 


00 


38. 


37 




ATOM 


5034 


OH2 


WAT 


1151 


13. 


307 


43. 


102 


37. 


676 


1. 


00 


40. 


57 




ATOM 


5035 


OH2 


WAT 


1153 


32. 


37 6 


39. 


513 


22. 


416 


1. 


00 


42. 


97 




ATOM 


5036 


OH2 


WAT 


1155 


14. 


863 


50.906 


47. 


090 


1. 


00 


49. 


79 




ATOM 


5037 


OH2 


WAT 


1156 


39. 


040 


11. 


571 


36. 


195 


1. 


00 


42. 


49 


10 


ATOM 


5038 


OH2 


WAT 


1157 


55. 


760 


18. 


990 


54. 


253 


1. 


00 


46. 


82 




ATOM 


5039 


OH2 


WAT 


1159 


29. 


811 


44. 


316 


51. 


775 


1. 


00 


29. 


84 




ATOM 


5040 


OH2 


WAT 


1160 


26. 


254 


30. 


191 


18. 


160 


1. 


00 


47. 


30 




ATOM 


5041 


OH2 


WAT 


1163 


50. 


589 


53.935 


30. 


727 


1. 


00 


41. 


87 




ATOM 


5042 


OH2 


WAT 


1164 


26. 


778 


17 . 


598 


17. 


107 


1. 


00 


39. 


40 


15 


ATOM 


5043 


OH2 


WAT 


1166 


57. 


293 


56. 


491 


51. 


361 


1. 


00 


36. 


42 




ATOM 


5044 


OH2 


WAT 


1167 


55. 


900 


43. 


483 


62. 


013 


1. 


00 


33. 


17 




ATOM 


5045 


OH2 


WAT 


1170 


45. 


416 


56. 


601 


68. 


328 


1. 


00 


39. 


64 




ATOM 


5046 


OH2 


WAT 


1171 


45. 


090 


53. 


796 


27. 


563 


1. 


00 


44 . 


12 




ATOM 


5047 


OH2 


WAT 


1172 


25. 


950 


27 . 


760 


17. 


747 


1. 


00 


47. 


76 


20 


ATOM 


5048 


OH2 


WAT 


1174 


. 21. 


716 


34 . 


736 


25. 


915 


1. 


00 


39. 


70 




ATOM 


5049 


OH2 


WAT 


1177 


28. 


580 


48. 


024 


33. 


877 


1. 


00 


34 . 


88 




ATOM 


5050 


OH2 


WAT 


1178 


45. 


395 


19. 


823 


49. 


333 


1. 


00 


48 . 


96 




ATOM 


5051 


OH2 


WAT 


1180 


66. 


122 


32 . 


218 


43. 


068 


1. 


00 


46. 


26 




ATOM 


5052 


OH2 


WAT 


1181 


61. 


892 


29 . 


586 


16. 


389 


1. 


00. 


40 . 


21 


25 


ATOM 


5053 


OH2 


WAT 


1183 


15. 


358 


27 . 


450 


47. 


492 


1. 


00 


49. 


59 




ATOM 


5054 


OH2 


WAT 


1184 


13. 


556 


47. 


284 


49. 


126 


1. 


00 


53. 


43 




ATOM 


5055 


OH2 


WAT 


1185 


47. 


096 


19. 


628 


33. 


301 


1. 


00 


39. 


63 




ATOM 


5056 


OH2 


WAT 


1186 


15. 


740 


33. 


792 


31. 


449 


1. 


00 


55. 


70 




ATOM 


5057 


OH2 


WAT 


1189 


28. 


976 


26. 


719 


60. 


440 


1. 


00 


42. 


03 


30 


ATOM 


5058 


OH2 


WAT 


1190 


18. 


223 


23. 


257 


49.730 


1. 


00 


49. 


11 




ATOM 


5059 


OH 2 


WAT 


1191 


51. 


509 


56. 


989 


54. 


192 


1. 


00 


38. 


61 




ATOM 


5060 


OH2 


WAT 


1193 


43. 


560 


45. 


405 


17. 


563 


' 1. 


00 


36. 


62 




ATOM 


5061 


OH2 


WAT 


1204 


30. 


518 


43. 


634 


25. 


079 


1. 


00 


47 . 


04 




ATOM 


5062 


OH2 


WAT 


1208 


21. 


476 


24. 


864 


27. 


340 


1. 


00 


34 . 


36 


35 


ATOM 


5063 


OH2 


WAT 


1209 


17. 


760 


41. 


993 


33.261 


1. 


00 


55. 


66 




ATOM 


5064 


OH2 


WAT 


1212 


46. 


798 


63. 


405 


52. 


749 


1. 


00 


51. 


44 




ATOM 


5065 


OH2 


WAT ' 


1219 


34. 


617 


51. 


206 


58. 


460 


1. 


00 


49. 


36 




ATOM 


5066 


OH2 


WAT 


1220 


24. 


104 


41. 


582 


29. 


552 


1. 


00 


45. 


40 




ATOM 


5067 


OH2 


WAT 


1223 


23. 


8 63 


21. 


524 


53. 


424 


1. 


00 


45. 


78 


40 


ATOM 


5068 


OH2 


WAT 


1224 


50. 


956 


18. 


524 


42. 


578 


1. 


00 


40. 


01 




ATOM 


5069 


OH2 


WAT 


1226 


49. 


969 


19. 


920 


49. 


613 


1. 


00 


44 . 


84 




ATOM 


5070 


OH2 


WAT 


1229 


49. 


709 


30. 


628 


33. 


104 


1. 


00 


41. 


21 




ATOM 


5071 


OH2 


WAT 


1230 


31. 


240 


38. 


559 


20. 


375 


1. 


00 


47. 


94 




ATOM 


5072 


OH 2 


WAT 


1232 


23. 


399 


22. 


619 


27. 


338 


1. 


00 


34 . 


56 


45 


ATOM 


5073 


OH2 


WAT 


1233 


48. 


110 


31. 


839 


37. 


064 


1. 


00 


42. 


29 




ATOM 


5074 


OH 2 


WAT 


1236 


47. 


248 


45.427 


20. 


268 


1. 


00 


40. 


26 




ATOM 


5075 


OH2 


WAT 


1237 


19. 


590 


16,455 


39. 


634 


1. 


00 


45. 


25 




ATOM 


5076 


OH2 


WAT 


1239 


40. 


962 


60. 


4 63 


74. 


404 


1. 


00 


45. 


01 




ATOM 


5077 


OH2 


WAT 


1243 


46. 


981 


33. 


979 


68. 


748 


1. 


00 


49 . 


24 


50 


ATOM 


5078 


OH2 


WAT 


1244 


61. 


241 


43. 


880 


58. 


980 


1. 


00 


42 . 


97 




ATOM 


5079 


OH2 


WAT 


1250 


35. 


115 


51. 


055 


23. 


889 


1. 


00 


51. 


05 




ATOM 


5080 


OH2 


WAT 


1255 


12. 


571 


50. 


793 


52. 


116 


0. 


50 


33. 


17 




ATOM 


5081 


OH2 


WAT 


1266 


25. 


841 


9. 


552 


44. 


564 


1. 


00 


55. 


17 




ATOM 


5082 


OH2 


WAT 


1268 


61. 


336 


50. 


541 


51. 


337 


1. 


00 


45. 


61 


55 


ATOM 


5083 


OH2 


WAT 


1273 


23. 


939 


29. 659 


26. 


509 


1. 


00 


42. 


88 




ATOM 


5084 


OH2 


WAT 


1275 


59. 


410 


55. 


437 


44. 


599 


1. 


00 


42. 


16 




ATOM 


5085 


OH2 


WAT 


1278 


64. 


290 


43, 


553 


53. 


436 


1. 


00 


38. 


01 




ATOM 


5086 


OH2 


WAT 


1279 


36. 


463 


61. 


861 


41. 


741 


1. 


00 


31. 


97 




ATOM 


5087 


OH2 


WAT 


1282 


37. 


884 


28. 


719 


70. 


368 


1. 


00 


50. 


75 


60 


ATOM 


5088 


OH2 


WAT 


1286 


21. 


758 


53. 


622 


63. 


516 


1. 


00 


46. 


95 




ATOM 


5089 


OH2 


WAT 


1288 


49. 


866 


23. 


754 


32.094 


1. 


00 


47. 


08 



WO 2004/024765 1?1 PCT/GB2003/003966 





ATOM 


5090 


OH2 


WAT 


1289 


46. 


816 


19. 


829 


38. 


612 


1 


.00 


41. 


12 




ATOM 


5091 


OH2 


WAT 


1294 


41. 


690 


12. 


743 


36. 


520 


1 


.00 


45. 


23 




ATOM 


5092 


OH2 


WAT 


1302 


47. 


382 


23. 


700 


60. 


586 


1 


.00 


42. 


72 




ATOM 


5093 


OH2 


WAT 


1303 


65. 


406 


53. 


318 


38. 


129 


1 


.00 


42. 


46 


5 


ATOM 


5094 


OH2 


WAT 


1304 


16. 


963 


19.384 


30. 


420 


1 


.00 


49. 


48 




ATOM 


5095 


OH2 


WAT 


1308 


63. 


085 


28. 


921 


19. 


779 


1 


.00 


53. 


97 




ATOM 


5096 


OH2 


WAT 


1309 


23. 


293 


21. 


221 


25. 


162 


1 


.00 


40. 


04 




ATOM 


5097 


OH2 


WAT 


1312 


47. 


167 


34. 


527 


37. 


872 


1 


.00 


27. 


27 




ATOM 


5098 


OH2 


WAT 


1314 


10. 


864 


37. 


430 


37. 


980 


1 


.00 


28. 


15 


10 


ATOM 


5099 


OH2 


WAT 


1315 


44. 


856 


27. 


880 


46. 


189 


1 


.00 


30. 


95 




ATOM 


5100 


OH2 


WAT 


1317 


58. 


272 


25. 


539 


24. 


981 


1 


.00 


32. 


58 




ATOM 


5101 


OH2 


WAT 


1318 


18. 


820 


50. 


334 


62. 


005 


1 


.00 


36. 


58 




ATOM 


5102 


OH2 


WAT 


1319 


30. 


841 


37. 


131 


59. 


111 


1 


.00 


46. 


56 




ATOM 


5103 


OH2 


WAT 


1320 


29.617 


62. 


035 


68. 


933 


1 


.00 


44. 


43 


15 


ATOM 


5104 


OH2 


WAT 


1321 


46.544 


35. 


226 


40. 


439 


1 


.00 


40. 


06 




ATOM 


5105 


OH2 


WAT 


1322 


28. 


073 


10. 


853 


25. 


663 


1 


.00 


40. 


56 




ATOM 


5106 


OH2 


WAT 


1323 


42. 


441 


29. 


122 


46. 


515 


1 


.00 


39. 


18 




ATOM 


5107 


OH2 


WAT 


1324 


32. 


503 


25. 


207 


20. 


281 


1 


.00 


30. 


43 




ATOM 


5108 


OH2 


WAT 


1325 


45. 


433 


45. 


115 


68. 


057 


1 


.00 


34. 


28 


20 


ATOM 


5109 


OH2 


WAT 


1326 


60. 


152 


17. 


493 


35. 


180 


1 


.00 


31. 


13 




ATOM 


5110 


OH2 


WAT 


1327 


18. 


398 


27. 


449 


35. 


882 


1 


.00 


37. 


64 




ATOM 


5111 


OH2 


WAT 


1328 


25. 


504 


31. 


112 


20. 


438 


1 


.00 


38. 


34 




ATOM 


5112 


OH 2 


WAT 


1329 


30. 


779 


47. 


918 


51. 


067 


1 


.00 


35. 


49 




ATOM 


5113 


OH2 


WAT 


1330 


19. 


913 


28. 


817 


33. 


764. 


1 


.00 


31. 


28 


25 


ATOM 


5114 


OH2 


WAT 


1331 


41. 


767 


10. 


738 


28. 


096 


1 


.00 


41. 


34 




ATOM 


5115 


OH2 


WAT 


1332 


26. 


547 


23. 


240 


56.068 


1 


.00 


41. 


06 




ATOM 


5116 


OH2 


WAT 


1333 


33. 


925 


58. 


939 


47. 


024 


1 


.00 


34. 


77 




ATOM 


5117 


OH2 


WAT 


1334 


22. 


602 


51. 


105 


36. 


733 


1 


.00 


43. 


04 




ATOM 


5118 


OH2 


WAT 


1335 


22. 


996 


38. 


949 


26. 


371 


1 


.00 


38. 


28 


30 


ATOM 


5119 


OH2 


WAT 


1336 


29. 


162 


20. 


497 


16. 


594 


1 


.00 


43. 


76 




ATOM 


5120 


OH2 


WAT 


1337 


20. 


274 


17. 


838 


34. 


214 


1.00 


44. 


34 




ATOM 


5121 


OH2 


WAT 


1338 


39. 


624 


36.468 


69. 


824 


1 


.00 


40. 


01 




ATOM 


5122 


OH2 


WAT 


1339 


37. 


483 


51. 


842 


59. 


120 


1 


.00 


44. 


97 




ATOM 


5123 


OH2 


WAT 


1340 


24. 


717 


29. 


196 


23. 


868 


1 


.00 


34. 


46 


35 


ATOM 


5124 


OH2 


WAT 


1341 


44. 


042 


21. 


496 


47. 


701 


1 


.00 


33. 


63 




ATOM 


5125 


OH2 


WAT 


1342 


13. 


466 


35. 


511 


57. 


867 


1 


.00 


42. 


68 




ATOM 


5126 


OH2 


WAT 


1343 


55. 


772 


29. 


777 


32. 


992 


1 


.00 


44. 


58 




ATOM 


5127 


OH2 


WAT 


1344 


14.280 


32. 


060 


49. 


325 


1 


.00 


38. 


28 




ATOM 


5128 


OH2 


WAT 


1345 


45. 


443 


50. 


731 


25. 


094 


1 


.00 


46, 


95 


40 


ATOM 


5129 


OH2 


WAT 


1346 


68. 


409 


39. 


510 


47. 


837 


1 


.00 


41. 


69 




ATOM 


5130 


OH2 


WAT 


1347 • 


39. 


778 


58. 


278 


43. 


142 


1 


.00 


32. 


41 




ATOM 


5131 


OH2 


WAT 


1348 


29. 


470 


22. 


597 


20. 


692 


1 


.00 


48. 


08 




ATOM 


5132 


OH2 


WAT 


1349 


24.093 


35. 


444 


64. 


264 


1 


.00 


35.01 


45 


ATOM 


5133 


OH2 


WAT 


1351 


26. 


090 


30. 


612 


28. 


125 


1 


.00 


36.91 


ATOM 


5134 


OH2 


WAT 


1352 


17. 


332 


33. 


459 


57. 


443 


1 


.00 


41, 


72 




ATOM 


5135 


OH2 


WAT 


1353 


56. 


042 


23. 


489 


25. 


469 


1 


.00 


43, 


75 




ATOM 


5136 


OH2 


WAT 


1354 


42. 


787 


43. 


334 


56. 


083 


1 


.00 


40. 


09 




ATOM 


5137 


OH2 


WAT 


1355 


50. 


830 


26. 


069 


35. 


611 


1 


.00 


40. 


02 


50 


ATOM 


5138 


OH2 


WAT 


1356 


38. 


381 


12. 


374 


24. 


731 


1 


.00 


44. 


11 


ATOM 


5139 


OH2 


WAT 


1357 


44. 


282 


44. 


869 


20. 


284 


1 


.00 


42. 


34 




ATOM 


5140 
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Table B 

Co-ordinates of an underglycosylated tACEA36NJ ACE-lisinopril complex 

5 REMARK coordinates from minimization refinement 
REMARK refinement resolution: 47.14 - 2.0 A 
REMARK starting r= 0.1816 free_r= 0.2186 
REMARK final r= 0.1814 free_r= 0.2188 
REMARK rmsd bonds= 0.005621 rmsd angles^ 1.23822 
10 REMARK wa= 0.450786 

REMARK target= mlf cycles= 1 steps= 10 

REMARK sg= P2 (1)2 (1)2 (1) a= 56.472 b= 84.899 c= 133.990 alpha= 90 
beta= 90 gamma= 90 

REMARK anomalous £' f 9 ' library: CNS_XRAYLIB : anom_cu . lib 
15 REMARK reflection file= . . /dmlO_2A. cv 
REMARK ncs= none 

REMARK B-correction resolution: 6.0 - 2.0 
REMARK initial B-factor correction applied to fobs : 
REMARK Bll= 1.410 B22= 0.301 B33= -1.711 
.20 REMARK B12= 0.000 B13- 0.000 B23= 0.000 

REMARK B-factor correction applied to coordinate array B: 0.605 
REMARK bulk solvent: density level= 0.336734 e/A A 3, B-factor= 42.5542 
A^2 

REMARK reflections with | Fobs | /sigma_F < 0.0 rejected 
25 REMARK reflections with | Fobs | > 10000 * rms(Fobs) rejected 
REMARK anomalous diffraction data was input 

REMARK theoretical total number of refl. in resol. range: 84079 ( 
100.0 % ) 

REMARK number of unobserved reflections (no entry or |F|=0):821 ( 
30 5.7 % ) 

REMARK number of reflections rejected: 0 ( 0.0 % ) 

REMARK total number of reflections used: 79258 ( 94 . 3 % ) 

REMARK number of reflections in working set: 76164 ( 90.6 % ) 

REMARK number of reflections in test set: 3094 ( 3.7 % ) 
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89 35.675 44.844 

89 38.317 43.904 

89 37.875 42.762 

90 38.980 44.526 
90 39.224 43.868 
90 39.759 44.873 
90 38.739 45.905 
90 38.805 46.752 
90 37.665 47.586 
90 39.717 46.890 
90 37.593 46.250 
90 36.943 47.257 
90 37.411 48.546 
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90 38.319 48.662 
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90 40.089 41.692 

91 41.175 42.870 
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91 43.810 40.313 
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92 38.940 40.242 
92 39.655 40.108 
92 38.998 40.903 
92 37.797 40.691 
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92 38.731 37.817 

93 38.230 39.824 
93 37.343 39.253 
93 36.609 40.346 
93 35.739 39.782 
93 34.533 39.149 
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95 41.041 35.482 
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95 40.147 33.311 

96 38.715 34.867 
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37.266 33.552 64.984 1.00 23.78 

36.898 32.403 64.724 1.00 23.70 

37.426 34.483 64.0.51 1.00 21.38 

37.162 34.200 62.647 1.00 22.12 

37.224 35.506 61.845 1.00 20.92 

36,706 35.353 60.433 1.00 22.34 

35.908 34.459 60.140 1.00 20.62 

37.144 36.243 59.548 1.00 20.25 

38.192 33.189 62.142 1.00 21.72 

37.863 32.270 61.393 1.00 21.75 

39.437 33.353 62.576 1.00 21.37 

40.502 32.449 62.172 1.00 23.30 

41.856 32.963 62.676 1.00 21.74 

42.995 32.002 62.444 1.00 23.76 

43.871 31.960 61.305 1.00 22.12 

44.772 30.891 61.510 1.00 23.47 

43.979 32.720 60.131 1.00 21.74 

43.390 30.985 63.266 1.00 23.18 

44.456 30.314 62.712 1.00 24.63 

45.772 30.562 60.584 1.00 21.94 

44.973 32.392 59.209 1.00 20.39 

45.856 31.320 59.444 1.00 20.85 

40.251 31.041 62.702 1.00 23.90 
40.306 30.061 61.953 1.00'21.48 
39.963 30.945 63.997 1.00 25.31 
39.716 29.648 64.619 1.00 24.92 
39.393 29.820 66.103 1.00 24.71 
40.545 30.430 66.874 1.00 25.68 
41.690 30.397 66.423 1.00 26.00 

40.252 30.980 68.046 1.00 25.62 
38.594 28.901 63.922 1.00 24.14 
38.627 27.674 63.820 1.00 24.01 
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330 CB ILE 

331 CG2 ILE 

332 CGI ILE 

333 CD1 ILE 

334 C ILE 
O ILE 
N LEU 
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336 
337 
338 
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343 
344 
345 
346 
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CB LEU 
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341 CD2 LEU 
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375 ND2 ASN 

376 C ASN 
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N MET 



377 
378 
379 
380 
381 
382 
383 
384 
385 
386 
387 
388 

389 CG 



CA MET 

CB MET 

CG MET 

SD MET 

CE MET 

C MET 
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N 



MET 
GLN 



CA GLN 
CB GLN 



390 



GLN 
CD GLN 



111 

111 

111 

111 

111 

111 

112 

112 

112 

112 

112 

112 

112 

112 

113 

113 

113 

113 

113 

113 

113 

113 
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114 
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114 

114 

114 
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115 

115 

115 

115 

115 

115 

115 

115 

116 

116 

116 

116 

116 

116 

116 

116 

117 

117 

117 

117 

117 

117 

117 

117 

118 

118 

118 

118 

118 



33.354 

34.278 

33.. 205 

34.494 

32.138 

32.164 

32.247 

32.404 

32.644 

32.720 

33.789 

33.021 

31.178 

31.303 

29.997 

28.754 

27.552 

27.451 

26.281 

27.279 

28.590 

27.976 

29.143 

29.074 

29.403 

28.383 

28.764 

28.850 

28.996 

30.060 

29.773 

31.224 

32.235 

33.547 

34.622 

34.924 

35.903 

36.078 

31.718 

32.132 

30.813 

30.215 

29.223 

29.855 

30.356 

29.827 

29.465 

29.675 

28.586 

27.783 

26.843 

25.382 

25.127 

23.347 

28.638 

28.287 

29.768 

30.653 

31.741 

31.195 



31.318 

32.448 

30.271 

29.560 

33.002 

34.180 

32.635 

33.629 

32.948 

33.872 

34.938 

33.046 

34.529 

35.754 

33.921 

34.680 

33.730 

32.874 

31.905 

33.777 

35.624 

36.685 

35.234 

36.057 

35.218 

34.113 

33.207 

33.644 

31.932 

37.208 

38.336 

36.918 

37.944 

37.318 

37.276 

38.684 

38. 699 

40.092 

38.937 

40.094 

38.472 

39.325 

38.530 

37.950 

38.683 

36.627 

40.459 

41.632 

40.086 

41.044 

40.299 

40.289 

39.995 

40.015 

42.039 

43.212 

41.576 

42.447 

41.625 

40.567 



69.722 

70.166 

70.828 

71.183 

68.334 

68.693 

67.061 

66.007 

64.656 

63.430 

63.643 

62.187 

65.917 

65.907 

65.853 

65.761 

65.669 

64.397 

64.514 

63.179 

66.950 

66.831 

68.095 

69.297 

70.537 

70.801 

71.960 

73.110 

71.658 

69.157 

69.555 

68.587 

68.369 

67.889 

68.960 

69.443 

70.598 

71.099 

67.335 

67 .310 

66.480 

65.458 

64.602 

63.356 

62.502 

63.239 

66.146 

65.847 

67.070 

67.812 

68.764 

68.317 

66.544 

66.451 

68.585 

68.697 

69.106 

69.868 

70.563 

71.509 



1.00 30.98 

1.00 29.81 

00 31.92 

00 31.87 

00 30.85 

00 31.16 

00 29.75 

00 28.55 

00 27.63 



.00 26.14 

.00 22.46 

.00 24.13 

.00 29.28 

1.00 29.00 

1.00 29.16 



00 29.91 
00 29.27 
00 29.32 
00 30.07 
1.00 29.18 
1.00 31.04 
00 30.89 
00 32.01 
00 33.69 
00 35.55 
00 40.14 



00 43.52 
00 44.43 
00 44.83 
00 33.40 
00 32.65 
00 33.87 
00 33.82 
00 36.05 
00 39.80 
00 42.99 
00 45.69 
1.00 48.43 
1.00 32.81 
1.00 30.93 
1.00 32.71 
1.00 33.77 
1.00 34.22 
1.00 35.11 
1.00 35.08 
1.00 37.76 
00 33.07 
00 32.40 
00 33.09 
00 32.98 
00 34.71 
00 39.02 
1.00 41.96 
1.00 41.75 
1.00 31.36 
1.00 29.55 
1.00 31.05 
1.00 31.16 
1.00 34.52 
1.00 38.71 



32.270 39.974 72.402 1.00 41.62 
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149 33.802 55.601 

149 33.088 54.606 
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450 54.473 38.853 36.121 1.00 13.77 

450 52.976 39.149 35.969 1.00 12.17 

450 52.758 40.620 35.637 1.00 9.82 

450 52.398 38.262 34.869 1.00 11.71 

450 56.325 37.165 36.283 1.00 14.53 

450 56.651 36.443 35.339 1.00 15.40 

451 57.212 37.745 37.085 1.00 13.59 
451 58.644 37.540 36.900 1.00 15-77 
451 59.441 38.498 37.789 1.00 14.12 
451 59.381 39.818 37.273 1.00 14.82 
451 59.035 36.098 37.211 1.00 15.90 

451 59.843 35.501 36.501 1.00 16.58 

452 58.448 35.543 38.269 1.00 16.02 
452 58.722 34.169 38.674 1.00 17.43 
452 57.903 33.786 39.933 1.00 16.72 
452 58.108 32.308 40.274 1.00 16.74 
452 58.321 34.657 41.102 1.00 18.21 
452 58.381 33.182 37.554 1.00 18.85 

452 59.101 32.205 37.337 1.00 18.12 

453 57.287 33.453 36.843 1.00 18.69 
453 56.820 32.590 35.755 1.00 18.97 
453 55.402 33.001 35.336 1.00 18.72 
453 54.463 32.712 36.356 1.00 19.55 
453 57.693 32.523 34.506 1.00 18.98 

453 57.559 31.593 33.707 1.00 17.86 

454 58.576 33.499 34.320 1.00 18.89 
454 59.426 33.499 33.138 1.00 19.78 
454 60.385 34.699 33.123 1.00 18.87 
454 61.150 34.722 34.336 1.00 19.79 
454 59.603 35.997 32.975 1.00 18.32 
454 60.239 32.217 33.041 1.00 21.57 

454 60.734 31.707 34.045 1.00 22.35 

455 60.378 31.673 31.822 1.00 21.55 
455 59.824 32.153 30.540 1.00 21.60 
455 61.142 30.440 31.632 1.00 23.12 
455 61.278 30.354 30.113 1.00 22.44 
455 59.972 30.933 29.641 1.00 20.77 
455 62.489 30.489 32.338 1.00 23.94 

455 62.854 29.566 33.064 1.00 23.00 

456 63.216 31.582 32.140 1.00 24.94 
456 64.527 31.726 32.748 1.00 25.77 
456 65;186 33.029 32.289 1.00 28.54 
456 66.708 32.960 32.293 1.00 33.33 
456 67.343 34.308 31.998 1.00 36.84 
456 68.857 34.190 31.983 1.00 39.24 
456 69.341 33.419 33.163 1.00 42.84 
456 64.479 31.679 34.274 1.00 25.66 

456 . 65.334 31.055 34.905 1.00 24.62 

457 63.485 32.332 34.873 1.00 23.14 
457 63.386 32.325 36.326 1.00 22.18 
457 62.312 33.299 36.817 1.00 22.71 
457 62.327 33.495 38.301 1.00 21.95 
457 61.859 32.717 39.307 1.00 24.49 
457 62.959 34.560 38.904 1.00 21.90 
457 62.884 34.429 40.217 1.00 24.67 
457 62.222 33.317 40.488 1.00 24.22 
457 63.067 30.922 36.836 1.00 21.29 

457 63.643 30.470 37.826 1.00 20.13 

458 62.136 30.243 36.170 1.00 21.42 
458 61.764 28.890 36.564 1.00 22.05 
458 60.633 28.355 35.677 1.00 21.29 
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470 62.479 32.197 

470 61.641 32.185 

471 62.180 32.479 
471 60.829 32.832 
471 60.801 33.040 
471 59.772 31.807 

471 58.822 32.140 

472 59.933 30.567 
472 58.974 29.515 
472 59.369 28.213 
472 58.968 28.190 
472 59.374 26.913 
472 59.651 25.908 
472 59.397 26.915 
472 58.790 29.276 

472 57.703 28.901 

473 59.847 29.487 
473 59.740 29.305 
473 61.123 29.172 
473 61.817 27.889 
473 " 61.971 26.749 
473 62.435 27.665 
473 62.939 26.444 
473 62.671 25.866 
473 58.993 30.497 

473 58.305 30.361 

474 59.129 31.660 
474 58.448 32.864 
474 58.886 34.096 
474 60.219 34.644 
474 60.417 34.795 
474 61.063 34.939 
474 56.943 32.705 

474 56.182 33.000 

475 56.525 32.252 
475 55.110 32.059 
475 54.893 31.631 
475 53.398 31.481 
475 55.498 32.680 
475 54.942 34.083 
475 - 54.522 31.009 

475 53.405 31.171 

476 55.270 29.937 
476 54.795 28.894 
476 55.788 27.727 
476 55.621 26.738 
476 54.645 26.785 
476 56.575 25.822 
476 54.598 29.475 

476 53.601 29.183 

477 55.554 30.288 
477 55.474 30.913 
477 56.779 31.646 
477 56.688 32.527 
477 56.384 31.987 
477 56.901 33.898 
477 56.294 32.798 
477 56.813 34.720 
477 56.509 34.167 
477 54.318 31.908 
477 53.541 31.917 
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485 44,690 36.616 

485 43.823 37.382 

486 44.744 36.233 
486 43.753 36.707 
486 44.243 36.506 
486 43.126 36.849 
486 45.487 37.366 
486 45.269 38.863 
486 42.400 36.016 
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487 42.418 34.700 
487 41.176 33.954 
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487 40.390 34.462 

487 39.161 3'4.432 

488 41.110 34.937 
488 40.500 35.426 
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488 41.027 35.846 
488 40.310 34.873 
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488 39.765 36.763 
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490 35.325 37.586 

490 34.257 38.049 
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491 33.837 32.713 
491 35.580 33.757 
491 34.442 32.957 
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491 34.312 37.127 
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492 36.589 38.512 
492 38.046 38.946 
492 38.534 39.643 
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492 35.327 40.391 

493 35.344 40.042 
493 34.482 41.177 
493 34.633 41.544 
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493 33.904 42.801 

493 34.295 44.045 

493 33.630 45.210 

493 32.821 42.751 

493 32.150 43.912 

493 32.561 45.137 

493 31.912 46.285 

493 33.022 40.811 

493 32.254 41.585 

494 32.670 39.604 
494 31.327 39.053 
494 31.319 37.739 
494 30.282 36.622 
494 30.701 35.590 
494 30.192 35.977 
494 30.741 38.834 

494 29.566 39.112 

495 31.544 38.347 
495 31.019 38.055 
495 32.139 37.621 
495 31.549 37.318 
495 32.833 36.376 
495 30.197 39.171 

495 29.047 38.955 

496 30.768 40.360 
496 30.012 41.435 
496 30.954 42.439 
496 31.513 41.909 
496 31.025 40.855 
496 32.426 42.532 
496 29.015 42.125 

496 28.125 42.837 

497 29.143 41.923 
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497 29.702 42.954 
497 29.672 44.459 
497 28.615 45.082 
497 30.844 45.048 
497 26.847 41.804 

497 25.774 42.403 

498 26.961 40.520 
498 25.812 39.717 
498 26.189 38.231 
498 25.057 37.318 
498 24.815 36.788 
498 23.634 36.009 
498 25.480 36.898 
498 24.044 36.851 
498 23.185 36.063 
498 . 23.105 35.343 
498 24.950 36.232 
498 23.773 35.465 
498 25.333 40.151 

498 24.146 40.398 

499 26.252 40.243 
499 25.871 40.643 
499 27.038 40.468 
499 27.106 39.070 
499 27.992 39.005 
499 29.395 39.250 
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499 30.409 38.744 

499 30.171 37.961 

499 31.657 39.000 

499 25.325 42.065 

499 24.395 42.311 

500 25.887 42.996 
500 25.391 44.368 
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501 22.956 43.199 
501 21.650 42.807 
501 21.443 41.731 
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504 20.451 47.479 
504 20.413 46.822 
504 19.775 45.766 
504 21.033 47.396 
504 18.470 46.107 

504 17.615 46.892 

505 18.239 44.805 
505 16.929 44.244 
505 16.590 43.920 

505 15.457 43.528 

506 17.547 44.088 
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35.058 37.105 49.888 1.00 12.34 
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1085 53.699 57.790 

1086 16.670 32.247 

1087 22.296 31.936 

1088 47.914 17.001 

1089 57.985 47.492 

1090 35.749 34.442 

1091 31.476 35.919 

1092 57.549 59.850 

1093 37.244 31.058 

1094 37.707 33.592 

1095 65.341 43.727 

1096 29.571 56.164 

1097 28.460 20.997 

1098 18.695 33.000 

1099 19.269 34.045 

1100 50.893 61.236 

1101 56.400 43.894 

1102 56.721 41.861 

1103 34.429 49.906 

1104 41.246 9.747 

1105 20.981 21.798 

1106 23.739 19.876 

1107 38.769 41.131 

1108 29.718 9.526 

1109 27.449 9.015 

1110 26.205 9.875 

1111 24.669 47.026 

1112 31.450 56.875 

1113 29.193 57.847 

1114 48.270 62.126 

1115 25.062 58.442 

1116 22.740 60.186 

1117 39.213 37.453 

1118 38.121 40.798 

1119 37.346 43.433 

1120 37.001 39.773 

1121 18.157 30.227 

1122 33.762 10.574 

1123 37.474 59.970 

1124 41.061 44.584 

1125 70.442 38.041 

1126 15.491 41.776 

1127 25.212 43.116 

1128 26.383 21.154 

1129 50.740 60.798 

1130 41.735 39.836 

1131 42.701 60.987 

1132 14.659 29.765 

1133 34.102 38.199 

1134 45.354 44.890 

1135 35.915 44.195 

1136 15.867 48.663 

1137 15.232 36.067 

1138 24.046 35.123 
1140 42.820 43.043 

1142 30.164 46.247 

1143 36.787 20.701 

1144 55.849 49.287 

1145 50.753 46.872 

1146 45.737 64.613 

1147 28.723 16.491 
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Sequences 
Seq ID No. 1 
WT-ACE 

SQQVTVTHGTSSQATTSSQTTTHQATAHQTSAQSPNLVTDEAEASKFVEEYD 

RTSQWWNEYAEANWNYNTNITTETSKILLQKNMQIANHTLKYGTQARKFD 

VNQLQNTTIKRIIKKVQDLERAALPAQELEEYNKILLDMETTYSVATVCHPNG. 

SCLQLEPDLTNVMATSRXYEDLLWAWEGWRDKAGRAILQFYPKYVELINQA 

ARLNGWDAGDSWRSMYETPSLEQDLEPvLFQELQPLYLNLHAYVRRALHRH 

YGAQHINLEGPIPAHLLGNMWAQTWSNIYDLVVPFPSAPSMDTTEAMLKQG 

WTPRRMFKEADDFFTSLGLLPVPPEFWNKSMLEKPTDGREVVCHASAWDFY 

NGKDFRTKQCTTVNLEDLWAHHEMGHIQYFMQYKDLPVALREGANPGFHE 

AIGDVLALSVSTPKHLHSLNLLSSEGGSDEHDINFLMKMALDKIAFIPFSYLVD 

QWTlWTlVFDGSriXENYNQEWWSLRLKYQGLCPPVPRTQGDFDPGAKFHIPS 

S VP YIRYF VSFIIQF QFHE ALCQ AAGHTGPLHKCDI YQ S KE AGQRL AT AMKLG 

FSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLRTENELHGEKLGWPQYN 

WTPNSARSEGPLPDSGRVSFLGLDLDAQQARVGQWLLLFLGIALLVATLGLS 

QRLFSIRHRSLHRHSHGPQFGSEVELRHS 



Seq ID No. 2 
ACE-A36NJ 

LVTDEAEASKFVEEYDRTSQVVWNEYAEANWNYNTNITTETSKILLQKNMQI 

ANHTLKYGTQARKFDVNQLQNTTIKRIIKKVQDLERAALPAQELEEYNKILLD 

METTYSVATVCHPNGSCLQLEPDLTNVMATSRKYEDLLWAWEGWRDKAGR 

AILQFYPKYVELINQAARLNGYVDAGDSWTISMYETPSLEQDLERLFQELQPL 

YLNLHAYVRRALHRHYGAQHINLEGPIPAHLLGNMWAQTWSMYDLVVPFP 

SAPSNTOTTEAMLKQGWTPRRMFKEADDFFTSLGLLPWPEFWNKSMLEKPT 

DGPvEVVCHASAWDFYNGKDFRIKQCTTVNLEDLVVAHFIEMGHIQYFMQYK 

DLPVALREGANPGFHEAIGDVEALSVSTPKHLHSLNLLSSEGGSDEHDINFLM 

KMALDKIAFIPFSYLVDQWRWRVFDGSITJCENYNQEWWSLRLKYQGLCPPV 

PRTQGDFDPGAKFFUPSSVPYIRYFVSFIIQFQFHEALCQAAGHTGPLHKCDIY 

QSKEAGQRLATAMKLGFSRPWPEAMQLITGQPNMSASAMLSYFKPLLDWLR 

TE 

NELHGEKLGWPQYNWTPNS 



